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The Fluids Engineering Laboratory 
at the University of Michigan 


Although the purpose of this paper is specifically that of describing the Fluids Engineer- 
ing Laboratory at the University of Michigan, it is clear that an understanding of the 
faculty philosophy underlying the planning of this unit is a necessary part of the docu- 
ment. This is an age of technological expansion, the rate-of-change of which exceeds 
any progress the world has hitherto known. The seeking of new knowledge and the 
application of that knowledge is the work of an ever-increasing number of competent 
scientists and engineers not only skilled in a technology but equally successful in the 
art of human understanding and relationships. Institutions of higher education are 
an integral part of this technological age and, because of this fact, find themselves con- 
fronted with the task of foreseeing the educational and research needs of the future. 
This is an immense task, one which is occupying many of the best minds of this genera- 
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tion. 


The technological and social progress of the coming generation depends upon the 
continuing flow of students emanating from our institutions of learning. 


They must 


be adequately prepared for the responsibilities which they alone can assume. 


Faculty Planning 


T IS THE PURPOSE of any good engineering faculty to 
plan programs in education which will best prepare the student 
for his chosen career. The best way as conceived by one group 
may be different than the best way as conceived by another group, 
the reason being that the process, in final analysis, involves 
people, each with special talents and an individual philosophy of 
life. The programs are, therefore, a blending of those talents and 
philosophies to a common purpose. 

Certainly, the first requirement of any faculty is that of pro- 
viding an ethical and social environment in which the student 
may grow morally and socially into an individual acceptable to 
his fellow student and to those with whom he will one day par- 
ticipate in his technical career. Beyond this, any cducational 
program should be directed toward extending the student’s in- 
herent creative and analytical ability. A sound, logical, thought 
process and a background of fundamental information are es- 
sential to the outstanding performance of the professional man in 
any field of endeavor. 

Examination of past and present curriculums and an intelligent 
evaluation of changing needs of a profession would appear to be 
reasonable criteria upon which to proceed in the planning of both 
educational programs and physical facilities. Studies attendant 
to curriculum changes never cease in an engineering college. An 
alert faculty is never satisfied with its past performance. Plan- 
ning for a physical facility, however, cannot change continuously 
since a decision to proceed with construction is mandatory. It 
behooves a faculty group to plan well and adequately for a future 
need that is difficult to decide upon in this technological age. 

The Fluids Engineering Laboratory is the result of faculty 
planning which resulted in the publication of “Faculty Require- 
ments.”’ The “Faculty Requirements” is a written document 
specifically designed for the guidance of the architects and build- 
ing designers. It included all the detailed information developed 
by the faculty concerning functional arrangement and relation, 
capacity, distribution, special facilities and areas, special re- 

Contributed by the Hydraulic Division and presented at the Hy- 
draulic Conference, Ann Arbor, Mich., April 13-15, 1959, of Tue 
AMERICAN Society OF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
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quirements, and occupancy. These plans of the faculty have been 
wholeheartedly supported by the University Administration, the 
Board of Regents of the University, and the Governor and Legisla- 
ture of the State of Michigan. When completed, the State can 
boast of one of the most complete educational laboratories in the 
nation devoted to the study of the behavior of fluids, one of the 
very common forms of substances with which the engineer must 
deal. 

In order to understand the reasons for some of the faculty de- 
tails, it is considered essential that a background be provided. 
Adequate laboratory facilities have been a real concern in the 
College of Engineering at the University of Michigan. Adequate 
total space required to train the numbers of students properly and 
also relief from obsolescence are matters of utmost concern. In 
any facilities program, it becomes necessary to have established the 
use to which any one facility will be put; it is important to know 
what the faculty visualize in laboratory activity not only for the 
immediate future but the rather far-distant future; and it is re- 
quired that the space be developed in order that the cost of build- 
ing and the operating budgets be kept to a minimum. 

An increased activity in engineering college laboratories is to 
be expected in all advancing areas of technological progress; how- 
ever, laboratory activity usually lags behind in the educational 
program. Because of this apparent fact, it becomes necessary to 
study the historical as well as the future needs for space. 

Engineering laboratories are areas in which the research workers 
and students experience the application of sciences to engineering 
processes or to specialized technological developments. In the 
case of the undergraduate student, his laboratory experience is 
largely confined to his learning the physical phenomena predicted 
by the basic principles and fundamentals of the profession. The 
graduate student and the faculty laboratory activity is directed 
toward the extension of the fundamental knowledge in the several 
technological areas of development through advanced application 
of the sciences and mathematics. The space that has heretofore 
been occupied by laboratories reflects the two functions and is 
obsolete when it no longer serves both. Complete obsolescence is 
a fact in many educational institutions. 

Early examination of space needs as projected by the several 
professional departments at the University of Michigan indicated 
that each had certain laboratory interests which could be satisfied 
in a common physical plant where height, width, floor loading, 
and utilities supply were universal needs for their separate labora- 
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tory activity. Further, particularly for the undergraduate teach- 
ing function, it was immediately apparent that common physical 
equipment with minor variation frequently satisfied the need of 
several departments. Although it was obviously impossible be- 
cause of the weight of numbers to do away with duplication of 
equipment, certain economies of space and instrumentation, and 
higher use factor of a kind of equipment appeared possible. The 
possibility of providing high-capacity utilities, which might be- 
come the requirement of any group, was enhanced by giving 
consideration to distribution of common utilities to all groups 
within one facility. Finally, it is expected that the entire level of 
teaching and research in the very important subject, “Behavior of 
Fluids,”’ will be upgraded through the combined effort, experience, 
and talents of well-qualified men from the several departments. 
The Faculty Study Committee adopted a liberal definition of a 
fluid, thereby extending its interests not only from the liquid to 
gaseous phases of substances, but also into the area of solid sus- 
pensions in gases and liquids. In those fluid-study areas where 
special facility requirements were necessary, for instance, com- 
bustion studies involving more than normal ventilation require- 
ments and reaction studies where blowout protection for per- 
sonnel is necessary, it was agreed that a building separate from a 
general education and research unit should be considered. 


It soon became apparent that all faculty groups looked toward 
high-capacity utilities in different combinations to satisfy their 
projected research thinking and planning. For instance, Civil 
Engineering required high water quantities under closely con- 
trolled conditions; Chemical Engineering required high water 
quantities and high quantities of steam, and a modest amount of 
electric power; Mechanical Engineering required modest amounts 
of steam, large electric power distribution, and high quantities of 
water; Engineering Mechanics required a modest amount of 
water, carefully controlled, and electric power; Aeronautical En- 
gineering required electric power and ample surrounding space of 
the wind-tunnel area. To satisfy the quantity and the varying 
utility demands, it was necessary that some of the utilities be 
generally distributed, and in other cases that a grouping be con- 
sidered so that the departmental relationship in the laboratory 
could be maintained. 

The utility distribution costs soon indicated that certain types 
of space needs, namely, classrooms, offices, and small laboratories, 
likewise sought by the faculty, should be planned in a building 
other than the Fluids Engineering Laboratory, since the usage of 
that space did not indicate the need for high-capacity utilities. 
If, however, this space were provided in the same building, it 
would be traversed by an unused high-capacity distribution of 
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those utilities. The cost of distribution of the utilities in the 
Fluids Engineering Laboratory exceeds the cost of distribution of 
utilities required by a classroom and office-type of building by 
approximately four times. The difference in the cost of utilities 
for the Fluids Engineering Laboratory and a comparable size 
of classroom building will provide approximately 2'/, per 
cent more space of the classroom type in a total project of 
approximately 800,000 sq ft. This building program is con- 
sidered minimum by the Engineering College at the University 
of Michigan for the future numbers of students who will seek ad- 
mission. 


The Fluids Engineering Laboratory 


When the Fluids Engineering Laboratory project is com- 
pleted, it will provide 120,000 sq ft of laboratory, teaching, and 
research space. The first portion of the project which provides 
all the basic utility arrangement is completed. Approximately 
52,000 sq ft of laboratory is available in this portion of the 
project. The complete building is divided into two types of con- 
struction. The main portion of the building which is 408 ft long 
128 ft wide, is the usual column-type arrangement: 
Seventeen bays long and five bays wide and two stories high ex- 
cept in one small portion of the building which is three stories high. 
The annex, which will be completed with the second portion of 
the construction, is 230 ft long by 84 ft wide, built of rigid frame 
construction and one story high. 


and 


The building basically pro- 
vides both a large area where normal building column inter- 
ferences are encountered and a smaller area, the Annex, which is 
free from vertical columns within the space. 

The large portion of the laboratory is oriented east and west 
with reference to its long dimension and located so that access 
to either the first or second floor from ground level can be had by 
foot or vehicle. This flexibility was accomplished by taking ad- 
vantage of some natural topography. Figs. 1 and 2 are photo- 
graphs of the laboratory model as viewed from either side of the 
main building and indicate the accessibility to the first and 
second floor. The rigid frame portion of the building, the Annex, 
is adjacent to the east end of the larger portion of the building and 
is oriented north and south. Access to this portion of the building 
is gained at the level of the first floor of the main building from 
either the south or east side. 

Building column spacing and ceiling heights were of special 
concern to the faculty planning committee. Except for the 
transverse spacing of the building columns in the center of the 
structure, all columns are spaced on 24-ft centers. The center 
transverse spacing is 32 ft to provide a clearance for the fluid 
sump. The net clearance underneath the floor joist on the first 
and partial third floor is 15 ft; the clearance under the roof joist 


on the second floor is 12 ft. The net maximum clearance under 


the rigid frame of the Annex portion of the laboratory is 27 ft. 
Fig. 3 is a longitudinal section through the main portion of the 
building which will show the foregoing relationship. 

The load capacity of the first floor of the main building and that 
of the Annex is 400 psf. The load capacity of the second and par- 
tial third floor of the main building is 220 psf, respectively. The 
capacity of the first floor was dictated by the probable load that 
equipment might impose, the second and third-floor capacity was 
dictated by the concentrated wheel loading of the fork-lift truck 
which is the materials-and-equipment-hand]ing device for the 
complete laboratory. All floors are hard-surface concrete; the 
office areas are asphalt tiled. 

Figs. 4 and 5 show the general plan of the first and second floors, 
respectively. In order to be able to make installations that ex- 
ceed the height of the ceiling of the first floor, a 4-ft « 4-ft man- 
hole, mounted flush with the second-floor surface, the periphery 
of which is enclosed by an open drain trench to the sewer, has 
been provided in the center of each bay. The manhole covers 
afford the same load capacity as the second floor thereby avoiding 
any hazard of break-through. The hard-surfaced floors are 
trowelled to provide a '/: in. to 10 ft slope to drain from each of 
the four columns at any bay. In addition to the manholes which 
could be used for utility connection, four 4-in. flush floor sleeves 
at the corners of each building column are provided for further 
utility distribution and communication as may be needed be- 
tween floors. In the area under the partial third floor, approxi- 
mately 30 per cent of the floor area is provided with removable 
plate sections in order that a total height of installation of ap- 
proximately 45 ft and a maximum of 4 ft in diameter can be ac- 
commodated. In a small area of the first floor, the depth of the 
pump pit No. 6 can be used to gain an additional height of in- 
stallation to a total of approximately 56 ft. 

A high-load-capacity concrete pad 100 ft wide and 72 ft long 
is provided immediately outside of the high bay area, and on the 
north side of the main building, for installation of outside-column 
installations. In order that connections of utilities and instru- 
mentation from within the building to these outside installations 
can be made, a series of 1-ft X-1-ft portholes are provided in 
the building wall. In addition to convenience, these portholes 
avoid the necessity of breaking the vapor barrier which is an in- 
tegral part of the building walls. 

The building walls are made up of 8 in. of brick, 2-in. foam-glass 
insulation, a membrane vapor barrier, and a 4-in. compressed- 
block inner wall. The net glass area in the building is less than 20 
per cent. All glass is of the double-pane, sealed construction. 
This construction was specifically directed at the problem of con- 
densation. The interior temperature of the laboratory walls is 
calculated to be above the dew point on the coldest days of the 
year. 


awe 


Fig. 3 Longitudinal building section, section viewed from south side 
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Building Utilities 


Utilities for the building are generally subdivided by func- 
tion; those that service the building generally and those that are 
specifically sized and distributed for power equipment used in re- 
search and teaching functions. 

Service utilities such as telephone outlets, city water for drink- 
ing fountains and rest rooms, 117-volt electric distribution, and 
like services are generally distributed. Four-gang, 30-amp elec- 
tric receptacles connected to a four-wire, 208-volt distribution af- 
fording 117 volts to the neutral wire are placed at each building 
column. This circuit is separately supplied in order to eliminate 
voltage fluctuations that might be caused by the starting of a 
heavy motor on a power circuit. Each outlet is separately 
fused, thus providing a total of 120 amp at each column. Welder 
outlets for 480-volt, 3-phase are provided at every second 
column. 

The heavier utilities as required for teaching and research 
(water, gas, electric, and steam) are distributed in a series of 
supply loops all hanging from the ceiling of the first floor of the 
main building. 

All the utility services including gas and air supply are available 
immediately below and to the sides of the manhole openings from 
first to second floor. This arrangement is shown in Fig. 6(a). 
Except for the head-tank system, it is possible to bring a maximum 
utility capacity up to the second floor at any bay in the building. 
In addition to the manhole opening, four 4-in. floor sleeves are 
arranged at each building column for the purpose of utility dis- 
tribution. shown in Fig. 6(b). All utilities are 
flexibly arranged in order that ready connections can be made to 
supply research and teaching equipment on either the first, sec- 
ond, or third floor. The supply loops run longitudinally, the 
length of the loop system being so located laterally that a mini- 
mum length of connection can be accomplished from any loca- 
tion in the building. 

There are three water systems in the building, two of which are 
supplied from a sump 192 ft long, 12 ft wide, and 10 ft deep in the 
shallow end, and a third supplied from city water supply. Ad- 
ditives to reduce rust and scaling are used in the sump-connected 


These are 


systems. One of the water systems is controlled by a head tank 
located next to the ceiling of the partial third floor approximately 
42 ft above the level of the first floor. The head tank is so de- 
signed that it provides approximately 140 ft of weir crest ad- 
justed to an accuracy of '/\» in. out of parallel and perpendicular 
to gravity lines of the earth. The outlet of the head tank is 
flexibly connected to a 48-in. standpipe extending down into 
pump pit No. 6. Two supply systems, 8 and 12 in., respectively, 
are connected to the standpipe and so valved that either or both 
systems may operate simultaneously. Since only a regulating 
quantity of water spills over the head-tank weir, the flow of water 
to the system is progressively from sump to standpipe to system 
and return to sump. Flanged outlets from the 48-in. standpipe 
are provided at both the floor and the ceiling levels at each floor. 
Venturi meters, which can be calibrated against weigh tanks, are 
arranged in the 12 and 8-in. systems to measure a maximum flow 
of 7 and 3 efs, respectively. The 12-in. system is arranged for a 
flanged outlet at every second bay; the 8-in. system is arranged 
with two flanged outlets at every bay 

The second water-supply system from the sump is pressurized 
to operate in conjunction with a cooling tower. This system will 
deliver 3 cfs at the farthest high point in the building at 60 psi. 
The pressurized return can be directed through the cooling tower 
or bypassed directly to the supply sump, depending entirely on 
the amount of heat absorbed by the water circulating in the 
system. The system supply is intended for the many heat-trans- 
fer functions required in the laboratory. Outlets at every bay, all 
of which are provided with vertically upward mounted tee con- 
nections and flanged caps, are provided from this system. The 
upward mounted tees are developed to prevent fouling of valves 
in the supplies to equipment. The cooling towers are designed 
for a maximum capacity of 30,000,000 Btu per hr. 

Both systems are sectionalized in a way that a portion of the 
system may be cut off for additional connections without shutting 
down other research activity. Return to sump from either system 
may be accomplished through transverse drain trenches in the 
first floor or through a pressurized return. 

The weigh tanks, each having a capacity of approximately 
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48,000 lb, are located at the end and immediately over the water 
sump at the end of sump farthest from the 48-in. standpipe. The 
weigh tanks are provided with an air-operated diverter in order 
that calibration quantities exceeding the capacity of one weigh 
tank may be considered. The tanks may be used to calibrate 
water quantities from any of the supply systems. 

Three-phase electric power at 480 volts is distributed parallel to 
the fluid supply loops in 600-amp, connected-bus ducts. The bus 
ducts, as the fluid systems, are hanging from the ceiling of the first 
floor. A connection at 1-ft intervals through fused plug-ins can 
be made to supply either the first, second, or third floor. The 
electric connection between the fused plug-in and the equipment 
requiring power is largely through flexible rubber-covered drop 
cable rather than by conduit. Conduit is used in preference to 
rubber-covered cable only where fixed installations are considered. 
A 4800-volt circuit is installed in the laboratory for possible future 
high demand. This circuit is not immediately available through 
use of plug-in boxes. Special wiring provision from junction 
boxes will be required to use this circuit. 

General distribution of d-c electricity has not been included. 
Initially, four d-c dynamometers ranging in capacity from 5 to 
200 hp have been installed; each, however, supplied by its own 
motor-generator located in the immediate vicinity of the dy- 
namometer. Where other d-c power is required a rectifier will be 
installed locally; for instance, 86 kw of 24-volt d-c power is in- 
stalled in the immediate vicinity of the area where heat-transfer 
activities are proceeding. 


Laboratory Installations 


The Fluids Engineering Laboratory has been planned to pro- 
vide a maximum degree of flexibility for the present and future 
use of the building. Only those partitions enclosing activities 
which were likely to be continued, regardless of a possible future 
requirement of our technology, were included. Such enclosures 
included a minimum number of offices, computing rooms, a 
multiphase laboratory, instrument storage rooms, rest rooms, 
stair wells, photographic rooms, and janitor closets. All other re- 
quired enclosures will be accomplished using minimum-cost cur- 
tain walls, which can be removed and rebuilt as an activity ex- 
pands or is reduced. In all such changes, the teaching or research 
activity has available to it the same capacity of utilities as was 
provided in the space it formerly occupied. 

The laboratory includes a minimum number of fixed installa- 
tions, all of which are of interest to a number of the professional 
groups. The water weigh tanks described earlier are typical of 
such installations. 

The multiphase laboratory has been partitioned in order that 
a pressure slightly lower than ambient may be maintained. It is 
desirable that the solid suspension in gas streams should not 
be distributed throughout the building. This area is provided 
with an unusually large number of exhaust and intake roof ven- 
tilators. 

A gasometer of approximately 600 cu ft capacity has been 
provided for calibration purposes as well as teaching. A closed- 
loop wind tunnel with an approximate 24 X 32-in. throat section, 
likewise required for teaching purposes, is included. The return 
and entering ducts of the tunnel are vertical and the closing 
horizontal leg is on the roof. This arrangement is for the pur- 
pose of conserving floor space. 

A large, low-capacity meteorological and smoke tunnel where 
air velocities in the order of 5 mph can be maintained in a throat 
section of 6 X 8 ft is planned to be suspended from the rigid frame 
of the Annex. Access to the throat section is gained from a bal- 
cony which is reached from the second-floor level. This facility 
will extend the important studies that have been in progress for 
several years on the distribution of air contaminates due to com- 
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bustion and chemical processes. It further will enable the ex- 
tension of studies carried on by the meteorological groups in pollen 
distribution and other research in which they are actively con- 
cerned. 

The entire roof section of the main laboratory building is ar- 
ranged with concrete pedestals which include anchor bolts, the 
top of which is approximately 8 in. higher than the roof deck and 
spaced in rectangular array on 8 X 12-ft centers. A roof grating 
capable of a 75 psf loading is installed on top of the roof pedestals 
in order that outside laboratory activity may be pursued. This 
space is particularly attractive for solar-radiation experimental 
apparatus, many activities in meteorology, and provides access 
and space around the cooling towers for teaching and research. 

The laboratory Annex is designed to provide unobstructed space 
where wave tanks may be installed. Truck access to this space 
is provided in order that the large volume of materials required 
for wave-tank studies may be moved readily. Two 80 x 100-ft 
4-ft-deep tanks may be provided by installing sections of tank 
wall. Either or both tanks may be removed when the space is 
needed for other activities. These tanks and the accompanying 
height of ceiling are the necessary facilities to continue the studies 
of beach erosion, forces on underwater structures by wave 
action, ship-docking studies, and breakwater research studies 
that have been so necessary to the special lakes and lake shore 
problems surrounding the State of Michigan. This facility is cer- 
tain to take on increased significance with the coming impact of 
the St. Lawrence Seaway. 

The major number of installations in the Fluid Engineering 
Laboratory are readily movable. No hold-down bolts or machine 
bases are fixed in the floor. Vibration isolation pads are used 
throughout. In those cases where the normal center of gravity is 
higher than considered safe, a concrete base for the equipment is 
designed and it is set on vibration isolation pads. 

Although a plan to provide a space for each graduate student 
has never been achieved in the Engineering College, such is the 
plan for the Fluids Engineering Laboratory. In order to achieve 
a maximum of flexibility and still house instrumentation and 
provide desk space for the student, a portable type of enclosure 
which can be set up in the open laboratory immediately adjacent 
to the student’s research equipment is being studied. Several of 
such enclosures are installed. At the completion of a project, the 
enclosure may be relocated for a second occupant where he may, 
likewise, enjoy a degree of privacy and close communication to his 
research work. 


Conclusion 


Space flexibility to provide for a rapidly changing technology 
has been the key consideration in the development of the Fluids 
Engineering Laboratory. The laboratory is not confined to a 
particular profession or a particular fluid. The new Department 
of Nuclear Engineering has many interests that revolve around 
fluid behavior. The long-standing profession of Civil Engineering 
and its store of knowledge on fluid flow can be big contributors to 
the special problems attendant to the atomic age. Magnetic 
radiation, magneto-hydrodynamics, and many other advanced 
topics are concerned with gaseous behavior. The continuous re- 
ductions in order from the macroscopic to the microscopic tend 
toward a better understanding of fluid phenomena. It is reasona- 
ble to expect that a better understanding of the continuous 
medium may result from work being done in these areas of ad- 
vanced study. 

It is the hope of the Faculty Planning Committee that the new 
laboratory will provide space and utilities in order that re- 
search, understanding, and the teaching of fluid behavior will con- 
tinue. The established knowledge is a necessary part of new 
technological areas where advanced study must continue. 
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Flow Parameters in Hydrostatic Lubrication 
for Several Bearing Shapes 


In the case of hydrostatic lubrication the designer of thrust bearings has to make 
decisions regarding the shape of pads, location of oil holes, and configuration of oil 
grooves. In this paper several pad shapes and associated oil inlets were investigated 
using conformal mapping techniques to obtain the total load-carrying capacity, flow rate 
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of oil, oil-film thickness, pressure and velocity distribution. 


The results of these calculations permit the designer to approximate his chosen con- 


Dciccitase LUBRICATION is the maintenance of an 
oi! film between two stationary surfaces by pressure produced by 
an external pump. It is different from hydrodynamic lubrication 
where the oil film is produced by the relative motion of two sur- 
faces. 

Hydrostatic lubrication gives considerable reduction of friction 
and wear, and is of practical importance in applications when the 
motion is too slow to produce hydrodynamic lubrication, as in 
the starting of heavy rotating machinery such as turbines, motor- 
generators, and synchrocondensers. Hydrostatic lubrication is 
used also in reversal of reciprocating mction, as in pistons, cross- 
heads, and the like. Elementary theory permits computation of 
the total load-carrying capacity, the flow rate of oil, and pressure 
and velocity distribution for the so-called hydrostatic step-bear- 
ing, which consists of two circular disks, the oil being supplied 
through a central hole. Aside from this case, the elementary 
considerations no longer suffice. Recourse must be had to study 
and solution of the Navier-Stokes equations in suitable approxi- 
mation. As will be made clear, it was possible to reduce the 
general case to determining a conformal mapping. 

The object of the present paper is to compute the flow parame- 
ters for several new cases in order to permit the designer to better 
approximate his own configuration by solved cases. 

The designs investigated are: 


1 Circular plates with central hole 
2 Square plates with central hole 
3. Elliptical plates with oil groove 
4 Infinite strip with central hole 


General Considerations 


The problem to be investigated is the steady flow between fixed 
parallel planes. The following assumptions are made: (a) That 
the velocity parallel to the boundary planes is small, (6) that the 
separation between the planes is so small that the velocity com- 
ponent in this direction can be neglected, (c) that the pressure is 
known at inlets and outlets. With these and other (listed later) 
assumptions, the Navier-Stokes equations can be reduced by 


! The authors wish to express their appreciation to Mr. T. C. Peng 
for assistance in carrying out and verifying the computations for the 
examples in this paper. 
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figuration by computed models in order to estimate the flow parameters. 


neglecting: (d) the time dependent and inertial terms,?(e) the body 
forces, and (f) compressibility terms to: 


1 

Vu = — 4 (1) 
or 
1 Op 

= — (2) 
oy 
1 

Vw = (3) 
pp Oz 


where 
u, v, and w are velocity components in the 2, y, and z-directions 

u is the (coefficient of) viscosity, and p is the pressure 

2b is the oil-film thickness 

If the z-axis is perpendicular to the boundary planes, then w = 
0. 

It follows from Equation (3) that under this condition p is a 
function of z and y only. 

The general continuity equation 

ou ov ow 


+ = 0 
oy oz 


reduces to 


= 0 (4) 


Equation (4) determines that a “stream function’ ¥(z, y, z) 
exists so that 


= -- ( 5 
u ) 
oy 
= 6 
or (6) 


From this one can infer [1, 2]* that the pressure p(2, y) is a har- 
monic function. Using the fact that the solution is unique and 
that u and v vanish at the boundary planes, it can further be 
shown that 


1 1 Op og 
= — . —s 
u (z b*) (7) 


? This assumption may not be accurate near the immediate neigh- 
borhood of inlet hole, where the inertia effects may predominate over 
viscous effects. 

3 Numbers in brackets designate References at end of paper. 
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1 
The function (2, y, z) = (z* — b*)p(z, y) evidently serves 


as a velocity potential for u and v in each layer z = constant. 
Also, in each layer, lines of constant pressure are lines of con- 
stant ‘‘velocity potential’ and vice versa. Determination of the 
pressure therefore determines the whole flow for this class of prob- 
lems. The problem of computing the flow parameters is thus re- 
duced to finding a potential (or harmonic) function p(z, y) satis- 
fying the boundary conditions on pressure, a problem mathe- 
matically identical to problems encountered in electrostatic 
fields, steady heat conduction, diffusion studies, and so forth. 


Problem 1—Circular Plates 


Given a bearing, Fig. 1, consisting of two circular plates with 
central holes. The pressure in the hole is p,; and is zero at the 
outer boundaries. 

Determine: The pressure distribution p 

The total load-carrying capacity P 
The flow rate Qrequired to maintain the film thick- 
ness h 


The problem is solved by the conformal mapping of fluid flow 
parallel to the @-axis in the w-plane into a flow in the z-plane 
(where z = x + iy) by means of the mapping function 

w= —minz = —mIinpe”® (9) 
It follows that 


—m(In p + 26) 


o = 


Lines of constant @ correspond to lines of constant pressure, 


—m In p 


p 
—minp = 
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The load-carrying capacity is 


wr? 
p= f prdrd@ = ——‘*~ p,[l + po? (In py? — 1)] 
0 ré In po? 


(13) 
and for 1, 


1 
r 
2 In 2 In 


where A is the area of the circular disk. 


P=- 


The flow rate is 


2 +b 
ff, v, r dz 


for any r; here v, is the velocity component in the r-direction 


i op 


1 
“le — — 
or 2 wu Or 


(15) 


Upon integration 


where h = 2b = oil-film thickness. 

These results (10), (14), (16) can be obtained directly by ele- 
mentary methods and are well known. They were computed to 
show the simplest application of the method developed. 


Problem 2—Square Plates 


Given a bearing, Fig. 2, consisting of two square disks with 
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central holes. The pressure in the hole is p,; and is zero at the 
outer boundaries. 


Equation (20) takes the form 


Determine: The pressure distribution p i In p* + 0.5415674 + 0.5933672 p* cos 40 
The total load-carrying capacity P 
The flow rate Q required to maintain the film thick- + 0.0518 p* cos 89 (25) 

ness h Equation (25) yields among other things the lines of constant 
pressure in polar co-ordinates p and 6. By the choice of w, 
the pressure was made zero at the outer boundary of the square at 
five points along each boundary line, viz., @ = 45 deg, 22.5 deg, 
0, —22.5 deg, and —45 deg. By choosing more terms in the ex- 
pression for w, the pressure can be made to vanish at more points 
on the boundary. The deviation of the pressure from the zero- 
value along the outer boundary was calculated along the side of 
w, = —minz = 2clnz (17) the square for the following points, Fig. 3. 


The problem is solved by superposition of a source flow (Prob- 
lem 1) for an infinite plate, and a secondary flow chosen such that 
the resulting pressure along the outer boundary of the square 
vanishes. We shall, for simplicity, however, merely determine 
the latter in a close approximation. 

The mapping function for a source flow is 


0.1 0.2 0.3 0.4 0.5 


—().0000004  —0.0001794 —0.0005928 —0.001025 —) 0007445 —0_ 0000004 


0.6 0.8 0.9 


—0.0000004 +0.0011438 +0.0022155 +0,0025856 +0.0018168 +0.000002 


The mapping function (approximate) for the secondary flow is 
chosen to have fourfold symmetry 


= do’ + + (18) 


where do’, a;’, as’ are to be determined so that the pressure van- 
ishes at five chosen points at the outer boundary. 
The resulting mapping function is 


| 
w = 2c’ Inz + ao’ + + = + ip (19) 


The real part of w, ie., ¢, is the velocity potential, and the 
lines of @ = constant correspond to lines of constant pressure in Fig. 3 
the z-plane. 


Re(w) = In p? + a» + asp* cos 46 + agp* cos 80 = e’ (20) 


Po 
The maximum deviation from zero ~— = 0.0025856 can be cx- 


pressed in terms of p, for a given a = 10 


At the outer boundary 


re a? 5 r3 (22) This value can be calculated from Equation (25). 
~ eos? @ cos? j 
Introducing a new parameter 8 = Equation (25) takes the 
a 


- andp =p, = 0 form 


Po _ cos 46 cos 80 p 
cost) cost 6 + ™ = ln + 0.542 + 0.1488 cos 40 + 0.003248" cos 88 (26) 


Imposing the conditions Lines of constant pressure were computed by choosing 6 constant 


r 
6=0 Py = 0 ) and computing p as a function of 8 = —, Fig. 4. 
| a 
0 = 22.5 deg Pp, = 0 (24) The stream lines (constant value of the stream function) were 
6 = 45 deg p, = 0 computed by setting the value of the imaginary part of the com- 


plex potential w equal to a constant; Im(w) = constant = c” 
permits the determination of do, as, and as by solving three simul- 
taneous equations obtained by introducing condition (24) into v = 3 (0 = ) + a,8* sin 40 + a6* sin 80 = (27) 
the equation (23), whence c 4 


do = 0.5415674 a, = 0.5933672 a = 0.0518 Solve for 
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P= 
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| 
4 
(21) 
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(28) 


‘ —a, sin 40 + (0: sin? 40 — 4asg sin 80 [- +2 (0 )}) 
= c 4 


a 2a, sin 80 


The flow rate is 
The values of 7 as a function of 6 for constant values of —> are 


‘ Qe +b 
plotted in Fig. 4. Q= f, v,7 dz d6 


POTENTIAL LINES 


= CONSTANT where 


STREAM LINES = CONSTANT 


Abe 


For r;<a 
&-008 
\ Bo? 
£-015 p=c'ln ( + age’ 
E-o2s \ 
nd 
\ = 
7025 
¢ In +a) 


Substituting (34) into (15) yields 


us 


i 02 03 04 05 06 | Problem 3 
aa a ; a Given a bearing, Fig. 5, consisting of two elliptic plates with an 
Fig. 4 oil groove connecting the two focuses of the ellipse. The pressure 
- in the groove is p,; and the pressure at the outer boundary is zero 
The load-carrying capacity is Determine: The pressure distribution p 
La a The total load-carrying capacity P 
4 cos 6 > - rate quire aintai ick- 
P=8 f f pr dr d0 (29) The flow rate Q required to maintain the film thick 
0 ri ness h 


where The problem is solved by conformal mapping of the source flow 


B2 (Problem 1) in the w-plane by means of the mapping function 


= In a,84 cos 40 + cos 86 
c 2 


c’ can be evaluated by setting r = r;< a for the inner boundary of 


the plate where the pressure p is equal to the inlet pressure p;. G7 
Ps (30) | 


* In Bo? — 0.15158 


Integrating the Equation (29) results in 


8a? 
Pa 
In By? — 0.15158 


where 


For 1 


3.7012 a? 0.925 


In ( ; ) — 0.15158 2 In ( :) — 0.54157 
a’ r; 
(33) 
' where A is the area of the square plate. Fig. 5 
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(36) 


This function transforms the inside of the p, circle of the w-plane 
into the inside of the ellipse in the z-plane. The circle py is 
mapped into the focal slit. 

Substituting w = p™ into the Equation (36) and setting z = 
z + iy one obtains 


4 (+ 
For p = pp», the ellipse degenerates into a slit f< f,n = 0 


If p > po, the ellipse approaches a circle. 
For example, if 


= 10, 


semimajor axis = 5,05 f 
semiminor axis = 4.95 f 


The lines of constant p correspond to lines of constant pressure 
in both the w and the z-planes, Fig. 6. The lines of constant pres- 
sure are ellipses given by the Equation (37). Introducing the 
source flow, 


(38) 


where 


at p = po, p = p;,in the slit; and at p = p,, p = 0 at the outer 
boundary. 
The major axis of the ellipse is 


R, Ro 
~ 9 (2: 
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£ = CONSTANT 


POTENTIAL LINES 


STREAM LINES 6=CONSTANT 


The minor axis of the ellipse is 


- 
2 \R R, 


It follows from (40) and (41) that 


d= 


c? — d? =f? 


or that the ellipses are confocal. 
The value of pp can be computed 


Ro c-—d 
For c and d much larger than f 


where D is equal to the diameter of the circle. 
The load-carrying capacity is 


P=SSpdidgn= fi Spdrdy 
integrated over the ellipse. 


Using the Jacobian transformation of the area dz dy in the 
z-plane into the area p dp d@ in the w-plane results in 


1 2s | dz 2 
P -{ p f?|\—  pdpdé 
0 | dw 
po 
| dz |? 


be computed from (36) 
w 


(44) 


(45) 


(47) 


The area of the ellipse A can be introduced, and the relations 
(42) and (43) substituted into (47) result in 
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Changing the co-ordinates to 
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te 2 41) 

c+d 
then 
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= In p 

R, 
R, 

-= 39) 
p R, ( 

Substituting (46) and (38) into (45) and integrating, one obtains 

af? 1 
4 In pp? P (+ 


or for c, d >} 


oil groove 


+b +1 

—b —1 


In order to compute > 


oy 
xz = F(p, 0) 
y = Gp, 8) 


oy 


op 
G,— G 
e dy + 6 
ra) 
Solve for 
oy 


Op 


Substituting in (53) for 


F=- 


gives 


Ov p=po 
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Differentiate (52) with respect to y 


fa) 
+ Fe 


oy F,Ge — FeG, 


fe) 
The differential = can be computed 
y 


The flow rate can be computed considering the outflow from the 


1 


1 
Oy 


notice that 


fl op op Op 
oy op oy 
From (38) follows 
(51) 
oy In po Oy p 


— can be computed by considering 


F(p,@) =0 


(52) 
—y =0 


00 
— 
oy 


06 
——-1=0 
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— Fe (53) 


sin 6 


(55) 


(49) 


Substituting (55) into (50) results in 
he 


il 
In| —— } 


(56) 


f 
or for c,d >f 


A 
Problem 4 


Given a bearing, Fig. 7, consisting of a narrow infinite strip 
with a central hole, where the pressure is p,;, reducing to zero at the 
outer bounu.:y. 


Determine: The pressure distribution p 
The total load-carrying capacity P 
The flow rate Q required to maintain the film thick- 
ness h 


The problem is solved by conformal mapping of the source flow 
(Problem 1) in the w-plane by means of the mapping function 


w+l 


= 


(58) 
This transforms the inside of the unit circle 1, 2, 3, 4 in the w-plane 
into the inside of the infinite strip 1, 2, 3, 4 in the z-plane. 

Substituting w = pe* into Equation (58) and setting z = z + 
iy, there follows 


1 | (p? + 2p cos 8 + 1)? 


2 = p* — 2p* cos 20 + 1 


A small inlet hole of radius pp< 1 corresponds to a circular hole 
in the z-plane 


Z PLANE W PLANE 


= ©) 


Fig. 7 
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Changing the co-ordinates to 


there follows 


1 p? 
T 2p sin 0 


(p? + 2p cos 6 


= —In 
T p*—2pcos9+1 | 


The lines of constant p correspond to lines of constant pressure 
in the w-plane; they are also constant-pressure lines in the z- 
plane, Fig. 8. The lines of constant @ correspond to streamlines, 
Fig. 8. 

In the w-plane 


(61) 


For po< 1 in the z-plane, there corresponds a circle with the radius 
r, (inlet radius). 
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By elimination of pp from (62) and (61) there follows 
p=cinp 


where 


Total load-carrying capacity 


P= f f p didn = — f f p dz dy 


Using the Jacobian transformation of the area dz dy in the 
z-plane ito the area p dp d@ in the w-plane yields 


4a? 1 | dz |2 
P= f pdp dé (65) 
Jo | dw | 
dz * 


~—| can be computed from (58) 

| dz |? 4 
| (66) 
| dw p* — 2p? cos 20 + 1 


Substituting (66), (63), (62) inte (65) and integrating, one obtains 


16a? 
Ps — Ps - + — In (67) 
In po" 8 


1 
Pr = AMI — In pe?) = 0.05605, 


and for 
ra’ 
a < 1, = 
da 
Tr, 
The load-carrying capacity of the infinite strip is approxi- 
mately twice that of a circular disk whose diameter equals the 


width of the strip. 
The flow rate is 


+b 2" 12 
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For small r 
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Substituting (70) into (69) and integrating results in 


hs 


in(4 


Q = 6 re ( )” 
In 


Summary of Results 


Circular bearing, Fig. 1 
1 
21n in 


Square bearing, Fig. 2 


0.925 hs 
P =- 
2 In — 0.54157 


Elliptic bearing, Fig. 5 
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2 In In — 
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Infinite strip bearing, Fig. 8 
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DISCUSSION 
Richard Elwell‘ 


The authors have presented a unique method of analyzing 
hydrostatic thrust bearings. Usefulness of the technique ap- 
pears to be unlimited, and its presentation comes at a time when 
there is a great deal of interest in this type of bearing. Up to 
now, bearings of such odd shapes as ellipses have been solved al- 
most exclusively on electric analog field plotters. 

In the derivation of Equation (13) in this paper, the term 
wp,r? has been omitted. This represents the portion of the 
load carried by the pocket of the simple circular bearing. The 
equation should be: 


— ar 
0 


A derivation of this may be found in [3].5 Equation (71) will re- 
duce to (14) for the special case in which po? << 1. 

To illustrate the error introduced, consider the following bear- 
ing, tested in the course of a project which will be the subject ofa 
forthcoming paper [4] on hydrostatic bearings: 

‘ Bearing and Lubricant Center, General Engineering Laboratory, 
General Electric Co., Schenectady, N. Y. Assoc. Mem. ASME. 

’ Numbers in brackets designate References at end of discussion. 
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p; = 120 psig 
; = 0.375 in. 
= 0.625 in. 


Measured load capacity = 87.3 lb 

Load capacity calculated by Equation (71) = 92.2 lb 

Load capacity calculated by Equation (13) = 39.1 lb 
The term 7 p,r;? for this case amounts to 53.1 |b, which is a major 
portion of the bearing load capacity. 

The authors’ solutions for the square plate bearing with central 
round hole may be compared to a previous solution of this shape 
by A. M. Loeb [5]. This geometry, and many others, were 
solved by an electric analog field plotter. The load and flow 
from [5] for a square plate 20 in. on a side, with a one inch di- 
ameter center hole, is: 


P = 60.1 p; 
py h® 


Q = 0.129 


The solutions for these quantities from this paper are: 


P = 53.7 p; (from Equation 31) 


Q = 0.171 a. (from Equation 35) 


It is seen that load capacity agrees fairly well, but there is a 
marked divergence in the two flow predictions, with the present 
paper predicting a full 32 per cent more flow than the electric 
analog solution. 
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Authors’ Closure 


Mr. Elwell’s discussion and indication of the excellent electric 
analog work of Mr. Loeb is greatly appreciated since it affords an 
opportunity to clarify an important aspect of our paper. 

Our illustrative examples all refer to cases wherein the internal 
hole is of very small radius so that the contribution of the oil 
within it to the load capacity is negligible. As may be noted, in 
no example has the “pressure-times-area’’ on the internal hole 
contribution to the load capacity been added in. Perhaps the 
basic source of confusion is the fact that the mapping for the first 
example (when one adds in the pressure contribution of the 
hole) holds good regardless of the size of the internal hole. 

When the hole is of finite size, the authors’ method is still 
valid, but more complicated conformal mappings may be 
required. E_.g., for the case of the square plate with circular hole 


” 


the approximate ‘“‘secondary flow’? would be determined by re- 
quiring its contribution to the pressure to vanish at certain points 
on the boundary of the hole as well as to make the fotal pressure 
zero at certain points on the boundary of the square. 

In any case, error in the pressure at any point introduced by 
the approximate mapping can always be easily estimated be- 
‘ause the maximum error must occur on the boundary, i.e., either 
on the internal hole or the boundary of the pad. Thus also sys- 
tematic development of this type of approximate mapping can 
lead to a computer method for determining the flow for arbitrary 
bearing shapes, the main elements of computation involved 
being solution of a system of simultaneous linear equations. 
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A General Method for Correlating 


F. E. HEFFNER 


Senior Research Engineer, 

Research Laboratories, 

General Motors Corporation, 
Warren, Mich. Assoc. Mem. ASME 


Labyrinth-Seal Leak-Rate Data 


The proper selection of an optimum labyrinth seal for a given application depends 
upon an accurate estimate of the seal leak rate. Suitable accuracy, particularly for 


straight labyrinth seals, can only be obtained from actual seal tests. Testing a com- 
plete series of seals is time-consuming and expensive. A method for correlating test 
data is presented which allows calculation of the leak rates for an entire family of 
labyrinth seals on the basis of tests of only two characteristic seals. Leak rates predicted 
by the method are within 3 per cent of the basic test data. 


se of the optimum labyrinth seal for any 
given application is highly dependent upon the ability of the seal 
designer to predict leak rates through the various possible seal de- 
signs. Theoretical leakage predictions are not usually accurate 
enough for careful design decisions, and it has become neces- 
sary to modify the theoretical estimates by empirical factors 
obtained from actual seal tests. Furthermore, maximum useful- 
ness is obtained from these tests only when general relationships 
between seal leak rates and physical seal-design parameters can 
be expressed. 

General methods of estimating the leakage of “‘staggered’’ 
labyrinths are available in the literature [1, 2, 3, 4],' while data 
upon which accurate estimates of “‘straight’’ labyrinth leakage 
can be based are less extensive and are more closely limited to 
exact seal configurations which have been tested [3, 7, 8]. Like- 
wise, correlations between empirical data and physical seal- 
design parameters are more widely applicable for staggered seals 
than they are for straight seals. 

An experimental study of the flow characteristics of straight iab- 
yrinth seals of small radial clearance was made with the intention 
of extending available design data into the small-clearance range 
by means of one of the existing correlation methods. Analysis of 
the test data, however, showed that known general relationships 
for other absolute dimensional ranges were not applicable with 
radial clearances smaller than 0.010 in., and it was necessary to 
evolve a new system for expressing relationships between seal 
designs and leak rates. 


! Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division of Tae American Society 
or MecHANicaL ENGINEERS and presented at the joint ASLE- 
ASME Lubrication Conference, New York, N. Y., October 20-22, 
1959. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, July 6, 
1959. Paper No. 59—Lub-7. 


Nomenclature 


The method of correlation which was developed for the small- 
clearance range, however, may be applied to labyrinth seals of any 
design. It allows prediction of the leak rates of a whole family 
of labyrinth seals on the basis of data which may be obtained 
from two tests on components typical of the seal family of in- 
terest. In certain instances one or both of these tests may be 
eliminated and in their stead theoretical relationships or existing 
data may be used. The accuracy of the leak rates predicted by 
the method given will be confirmed within 3 per cent by test. 
The primary purpose of this paper is to discuss the method of 
correlation. In doing this it is also necessary to present some test 
data and specific curves based upon these data. It must be 
emphasized that the correlation method is applicable to either 
straight or staggered labyrinth seals with blades of any size, shape, 
or spacing. However, the test data used to demonstrate correla- 
tion method must be applied only to straight labyrinth seals 
within the explicit dimensional limits for which they are given. 


Leak-Rate Analysis 


Analysis of the leak rates through labyrinth seals is most easily 
approached by reasoning analogous to that which describes the 
flow through single orifices. The basic equation for the leak rate 
through the seal may then have the form of Equation (1) which 
can be made identical to the familiar orifice equation if the 
proper pressure ratio function is introduced. 


= 
M = aAgl (1) 

Effective use of this equation to express the leak rate through 
a labyrinth seal depends upon the proper expression of the co- 
efficient @ and the proper definition of the seal-pressure-ratio 
function @. 

In the present analysis, @ is defined as the contraction co- 
efficient of the blades in the seal, and it is determined by experi- 


gas-flow area, sq in. 

clearance space, ID and OD, 
respectively, in. 

mass leak-rate velocity, lby/ 
(sec )(in.?) 

gravitational constant, (lby) 
(in.)/(Ibp (sec?) 

radial clearance, in. 

gas specific-heat ratio (dimen- 
sionless ) 
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mass leak rate, by per sec 

number of blades 

Reynolds number (dimension- 
less), Nre = 2Gh/p 

absolute pressure at seal inlet, 
Iby per sq in. 

absolute pressure at seal out- 
let, lbp per sq in. 

gas constant, (Ibp)(in.)/(Iby) 
(deg R) 


pressure ratio = P,/P 


critical pressure ratio 

absolute temperature at seal 
inlet, deg R 

blade thickness, in. 

specific leak rate, (Ibu )(deg 
W= G(T") 
/P 

contraction coefficient 

pressure-ratio function 


viscosity, Iby /(in. )(see) 
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A= M = for = 

Dy, dD, = N T = 

Nre = 

G = p = 
W = | 

a 
h = R on a= 

| = 


mental measurements of leak rates through a single-blade seal. 
Numerical values of contraction coefficients are calculated 
from single-blade test data where 


2k 1 ( 
g 
M = aA = k ‘ 


Correlation between single-blade test data and the blades in a 
multiple-blade seal is based upon Reynolds number which may be 
expressed in terms of a specific leak rate as 


= a p(T)” Qa ( 


The seals which are the subject of the present study are uni- 
formly constructed with the blade dimensions, radial clearances, 
and blade spacings invariant throughout a given seal. Thus the 
mass velocity through the seal is constant, the clearance is the 
same, for all of the blades, and the viscosity of the air as it passes 
through the seal is nearly constant, so that Reynolds number for 
a single blade in the seal is the same as the Reynolds number of 
the entire seal. 

The pressure-ratio function used in the present analysis is 
that derived by Egli [3] 


(4) 


Egli’s derivation contains several assumptions and approxima- 
tions, only two of which ever differ enough from actual conditions 
in a straight labyrinth seal to cause variation between the 


theoretical and experimental conditions. First, Egli postulated 
that the velocity from the expansion at one blade must be dissi- 
pated in the interblade space, so that the velocity of approach at 
the succeeding blade is very small. Second, in order to perform 
some mathematical operations, Egli assumed that the seal would 
have a “‘large number” of blades. There is no direct theoretical 


way to deduce how many blades this ‘‘large number’’ has to be, 
but it is to be expected that the derived pressure-ratio function 
will represent flow conditions accurately only in seals composed 
number of blades. 

With the parameters thus defined, it is convenient to use the 
specific leak to eliminate complexities caused by varying initial 
conditions and to rewrite the basic equation as 


of some minimum 


(5) 


Labyrinth-Seal Critical Flow 


The similarity between a labyrinth seal and a single orifice in- 
cludes a critical flow condition of a maximum gas velocity which 
is uninfluenced by further reductions in downstream pressure. 

In a labyrinth seal the static pressure decreases from blade to 
blade as the gas expands through the successive stages. Since 
the mass flow is constant throughout the seal, and since the 
stagnation temperature also remains essentially constant through- 
out the seal; it can be shown that the maximum gas velocity will 
occur at the blade farthest downstream. The seal flow becomes 
critical when this maximum velocity reaches the sonic velocity for 
the flowing gas. Thus, an ideal labyrinth will reach its critical 
over-all pressure ratio when the static pressure ratio at the last 
blade reaches the known critical value for the particular gas being 
used. 


This critical flow condition of the seal may be theoretically de- 
fined by the seal-pressure-ratio function, Equation (4) which be- 
comes maximum at a certain “eritical’’ over-all pressure for the 
respective number of blades. 


A comparison between this theo- 


retical criterion and experimentally determined critical pressure 
ratios is shown in Fig. 1. The solid line is the locus of the 
calculated critical pressure ratios, and the individual points show 
the experimentally determined over-all pressure ratio at which 
the static pressure ratio across the last blade of each seal became 
equal to the critical value for air. 
spacings are shown. 

There is some variation between theory and experiment, but 
the data points are generally within a 0.05 pressure ratio of the 
calculated curve. The significance of this much variation is 
best assessed by a consideration of Fig. 2 where complete curves 
of pressure-ratio functions for several numbers of blades are 
plotted. It can be seen that the value of the function changes 
very little for over-all pressure-ratio changes of 0.05 near the 
maxima. Thus the condition of seal criticality can be indicated 
by the occurrence of the maxima of the respective pressure-ratio 
functions. 


Data for seals of four blade 


The labyrinth-seal pressure-ratio function is thus applied in 
the same manner as the single-orifice pressure-ratio function by 
postulating that it retains its maximum value for all pressure 
ratios below that at which the maxima occur. The diagonal 
dashed line in Fig. 2 shows the maximum values of @ for all num- 
bers of blades. It is easily shown that @max is equal to re(k y'/2, 


Pressure Ratio at Max @ — Calculated 
© Blade Thickness .0055 inch 
o Blade Spacing 
= 9 ® 029 inch 
« c 0% 
x 88 “te 
& 
.00 
0 5 10 15 2 2 
NUMBER OF BLADES 
Fig. 1 Theoretical and experimental critical pressure ratios, radial clear- 


ance = 0.008 inch 


40 


é NUMBER OF BLADES 
5 


PRESSURE RATIO FUNCTION 
83 


. 
max 


OVERALL PRESSURE RATIO 
Fig. 2 Labyrinth-seal pressure-ratio functions 
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Experimental Method 


The leak-rate data referred to in the subsequent sections of the 
paper were obtained by a simple and direct test method. Com- 
pressed air was passed in series through a calibrated orifice flow- 
meter and the test seal, and the leak rate determined from the 
orifice-meter readings in the customary manner. 

The tests were limited to uniform, straight labyrinth seals of 
radial clearances less than 0.011 in. Only rectangular geom- 
tries were studied. In particular, the upstream and downstream 
sides of the blades were parallel and normal to the direction of the 
gas flow; and the steel blade edges were ground to maintain 

The 
Fig. 3 
is an enlarged scale drawing of the arrangement of the blades and 
housing for a typical test combination. Significant geometrical 


corners as sharp and free from imperfections as possible. 
data presented in this paper are for stationary seals only. 


nomenclature is shown in the sketch. 

Test seals were constructed by assembling suitably sized 
blades and spacers on a hub to form the bladed seal section. 
These blades, spacers, and the hub are shown in Fig. 4. This 
assembly was then inserted into a cylindrical housing to form the 
labyrinth seal 

It is evident that this method of seal construction, while suita- 
bly versatile, had the limitation that allowances and tolerances 
of the component parts would tend to change the seal dimensions 
slightly. In order to evaluate the effects of these manufacturing 
variations, seal build-ups and leak tests were repeated periodi- 
cally during the program to check the reproducibility of the data. 
In the seal tests thus checked, all of the test points were within 
three per cent of a mean curve through the data. On the basis of 
these duplicate tests, the accuracy of the test data which are pre- 
sented in this paper has been established at +3 per cent. For the 
purposes of these labyrinth-seal tests, the calibrated orifice 
meters which measured the leak rates can be assumed to have 


had no error. 


Construction of Correlation Curves 


Two sets of data are required to construct the basic curves from 
which leak rates through a whole family of seals may be calcu- 
lated. 

First, it is necessary to obtain numerical values of the contrac- 
tion coefficients for the blades of interest from leak-rate tests on 
a single blade. These may be plotted against Reynolds number 
as shown by the solid line in Fig. 5. 

Second, it is necessary to determine experimentally the leak 
rate, through a multiple-blade seal of the blade size and spacing 
of interest. The method of constructing the second correlation 
curve from these test data is outlined in Table 1. 

The data in columns 1, 2, 3, and 4 of the table may be deter- 
mined directly from multiblade test data. The contraction co- 
efficients in column 5 are taken from the single-blade test results 
shown as the solid line in Fig. 5. Column 6 may then be calcu- 
lated. Equation (5) suggests plotting values of W/a as ordinates 
against the respective values of @ in column 3. These data from 
Table 1 are shown by circular symbols in Fig. 6. 

The experimental data show a nearly linear relationship be- 
tween variables below the critical pressure ratio; but W/a 
becomes multivalued, and hence indefinite, when the pressure 
ratio becomes smaller than the critical ratio. The curves of Figs. 
5 and 6 therefore define the leak rate of the seal in the manner 
suggested by Equation (5) only for pressure ratios above the 
critical. 

The following expedient was employed to resolve this am- 
biguity at less-than-critical pressure ratios. The curve estab- 
lished for the lower values of the pressure-ratio function was ex- 
tended as shown by the dashed portion in Fig. 6 so that ordinates 
corresponding to the abscissas of the last four data points are in- 
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Fig. 4 Hub, blades, and spacers 
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Fig. 5 Contraction-coefficient-curve example 


It will be seen that the 
last three points which have a common value of the pressure-ratio 
function now also have a common value of W/a. The ordinates 
indicated by these crosses were then recorded in Table 1, column 
7, opposite their respective pressure-ratio functions; and new 
values of the contraction coefficients were computed from columns 
7 and 2. The new contraction coefficients are given in column 8, 
and they have then been plotted in Fig. 5 with their smoothed 
values indicated by the dashed line. 


dicated by the two crosses on the curve. 
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Table 1 


Labyrinth-seal test data 


Blade thickness 0.0055 in. 
Radial clearance 0.008 in 
Number of blades 10 
Blade spacing 0.108 in 
1 2 3 4 5 6 7 8 
Specific Single 
Pressure leak Reynolds blade ( ~) 
Run ratio rate, W o number a a a 
1 0.833 0.127 0.173 1473. 0.185 
2 0.711 0.166 0.217 2247 0.707 0.235 
3 0.622 0.188 0.240 2906 0.720 0.261 
4 0.551 0.203 0.253 3552 0.732 0.278 
5 0.497 0.215 0.262 4161 0.745 0.288 
6 0.330 0.240 0.277 6991 0.798 0.301 0.304 0.789 
7 0.247 0.248 0.280 9669 0.853 0.291 0.308 0.802 
fad 0.198 0.251 0.280 12226 0.880 0.285 0.308 0.815 
9 0.165 0.254 0.280 14841 0.893 0.284 0.308 0.825 


| Table 2 Recommended minimum numbers of blades 

Radial clearance, Blade spacing, Minimum number 

in. in. of blades 

0.029 14 
0.0465 14 
0.008 0.108 10 
0.217 7 
0.326 5 


wa 


a 


PRESSURE RATIO FUNCTION 
Fig. 6 Seal-correlati 


ve example 


Also, the slope of the curve in Fig. 6 is numerically greater than 
the theoretical constant of Equation (5) [0.7767 (Iby)(deg R’/*)/ 
(Iby)-(sec)], so it must be concluded that this equation does not 
exactly represent the operation of the labyrinth seal. The more 
general statement of Equation (5a) is therefore introduced with 
the intention that this functional relationship be expressed only 
graphically. 


(5a) 


Application of Correlation Curves 


The specific leak rate through a seal of any number of blades 
operating at any over-all pressure ratio can be calculated directly 
from the equations and curves of the previous sections. Equation 
(5a) as expressed by the curve of Fig. 6 may be used to establish 
the value of W/a when the pressure-ratio function has been 
evaluated from Equation (4). This value of W/a@ can be sub- 
stituted into Equation (3) with the particular values of the other 
operating-condition variables to obtain an equation expressing 
Reynolds number as a function of the contraction coefficient a. 
Another independent relationship between Reynolds number 
and the contraction coefficient is expressed by the curve of Fig. 5. 
The common solution to these two equations establishes the value 
of the contraction coefficient; and the value of the specific leak 
rate can be found when W/a and @ are known. 
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0.029 

0.0465 14 
0.005 0.108 10 

0.217 8 


0.326 


This procedure is demonstrated in detail in the Appendix. 
The example is based upon the data of Figs. 7 and 8 which are 
from actual tests of seals with 0.008-in. radial clearance and 
0.0055-in-thick blades at the various blade spacings shown. 
Since several spacings are considered in each chart, care must be 
exercised to ensure the use of the contraction-coefficient curve 
and leak-rate correlation curve for the spacing of interest. 


Discussion of Correlation Method 


It has been noted that the pressure-ratio function which is the 
basis of the correlation curves can be expected to produce dis- 
crepancies between theory and experiment for seals of too few 
blades or when there is velocity carry-over between the blades. 
The limitations imposed by these two conditions are discussed 
first. 

A demonstration of the blade-number effect is given in Fig. 9. 
Seal-correlation curves are plotted using data from tests on seals 
which have too few blades. The upper family of curves for 2, 3, 6, 
and 10 blades at 0.029-in. spacing shows that the function W/a 
tends to become too small as the number of blades decreases. 
The “seal-correlation curves’’ in this figure are portions of the 
exact curves of Fig. 7. The data of Fig. 7 are for various numbers 
of blades above the allowable minima, and that figure shows that 
the correlation with respect to blade numbers is good for higher 
numbers of blades. 

Further examination of Fig. 9 shows that, as the blade spacing 
increases, the allowable minimum number of blades tends to de- 
crease. It will be seen that at 0.217-in. spacing the data for only 
three blades follows the design curve quite closely until the maxi- 
mum value of the pressure-ratio function is approached. At the 
larger spacings, 0.651 and 1.302 in., data from tests on two-blade 
seals approach the theoretical line of Equation (5) very closely. 

Empirical analyses like those of Fig. 9 led to the recommended 
minimum numbers of blades given in Table 2. It will be seen 
that these minima are functions of both the radial clearance and 
the blade spacing. These data may be extrapolated without loss 
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Fig. 7 Seal-correlation curves, blade thickness = 0.0055 in., radial clearance = 0.008 in. 


of accuracy to the extent that the minima recommended for a 
spacing of 0.326 in. may be applied to seals of larger spacings as 
well. 

The velocity-carry-over effect is indicated by increased slopes 
of the leak-rate curves for the smaller clearances in Fig. 7 where 
the data are from seals with enough blades to eliminate the blade- 
number error. It is to be expected that the velocity effect will be 
greater at small spacing and that it will decrease as more inter- 
blade space is provided. The curves of Fig. 7 show that when the 
spacing is 0.326 in., the leak rate is only about 4 per cent greater 
than theoretical. From this it may be concluded that the line of 
Equation (5) may be used to calculate the leakage of seals with 
blade spacings greater than 0.326 in. 

The trends shown in the contraction-coefficient curves of Fig. 8 
indicate the same conclusion. The apparent contraction co- 
efficients in the multiblade seals approach the actual coefficients 
for the single blade as the blade spacing increases. This suggests 
that the gas in the seal nears the zero approach velocity of the 
single-blade test as the spacing increases. The conclusion that 
a spacing of 0.326 in. is sufficient to allow use of the single-blade 
data for the seal-contraction coefficient also comes directly from 
the method of construction of the correlation curve. For this 
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large spacing the apparent contraction coefficients in column 8 
Table 1, are found to equal the actual single-blade coefficients 
throughout the entire Reynolds-number range. 

This discussion of the restrictions imposed upon the sys- 
tem by the basic differences between the theoretical labyrinth 
seal and the practical labyrinth seal completes the description of 
the correlation method and establishes its limitations except for 
a definite statement as to the accuracy of the leak-rate calcula- 
tions which are based upon the method. If the contraction- 
coefficient curve is plotted for the exact blade geometry of in- 
terest—and this need not necessarily be the rectangular geometry 
of the present data—and if the correlation curve is constructed 
from tests on a seal of the same blade geometry with the blade 
spacing of interest and with more blades than the appropriate 
minimum given in Table 2; the leak rate through a similar seal 
of any number of blades above the stated minimum may be calcu- 
lated to the same accuracy as the experimental data. This ac- 
curacy estimate is entirely empirical, and it is based upon data 
from about 200 test runs at various blade sizes and spacings. 
When the data were plotted in the manner of Fig. 7, the points 
were always within 3 per cent of the mean curve. A 3 per cent 
deviation had been established as the variability of the experi- 
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ments by means previously mentioned. Extrapolation of the 
data to larger numbers of blades than those actually tested is 
allowed because of the theoretical foundation of the pressure- 
ratio function @, the evidence presented in Fig. 1, and studies 
like those shown in comparing Figs. 7 and 9. 

The correlation method may also be extrapolated with respect 
to seal diameter. That is, the method may be used to calculate 
the leak rates through seals of diameters different from those 
which are tested to obtain the data on which the curves are 
based. The seal radial clearance is usually so small with respect 
to the circumference of the seal blade that changing the diameter 
of the seal while maintaining the same blade dimensions and 
spacing will change only the area of the leak path. This area 
change, of course, has no effect on the specific leak rate which is 
the basis of the correlation method. 

In some instances it may be desirable to estimate the leakage 
through a straight-labyrinth-seal design under circumstances 
which do not warrant construction of the test apparatus required 
to obtain the bigh accuracy of the procedure outlined in this re- 
port. In this event, contraction-coefficient data probably can be 
obtained from the literature since a large number of single-blade 
tests have been reported [4, 5, 7, 9]. Fig. 10 has been included 
to supplement these data. Extrapolations of contraction co- 
efficients on the basis of thickness-to-clearance ratios are likely 
to be as much as 15 to 30 per cent in error, however. This is par- 
ticularly true in the small clearance range, as the data of Fig. 10 
indicate. 

The theoretical-correlation curve of Equation (5) may be used 
for seals with sufficiently large blade spacing or for staggered 
labyrinth seals. In the absence of additional experimental data 
from which new correlation curves for particular seals can be 
drawn, the curves of Fig. 7 are the best estimates available for 
small blade spacings. If these curves are used for clearances and 
blade thicknesses other than those for which they are drawn, but 
for clearances less than about 0.010 in. and blades less than 0.020 in. 


270 / suNne 


1960 


REYNOLDS NUMBER 


Contraction-coefficient curves, blade thickness = 0.0055 in., radial clearance = 0.008 in. 


10 12 14 16 18 
- THOUSANDS 


thick, they will probably estimate within 10 per cent of the cor- 
rect leak rate provided that the correct contraction-coefficient 
curve is used. No estimate of the error of extrapolating 
these curves to larger clearances has been made. 


Conclusions 


Theoretical estimates of the leak rates through straight 
labyrinth seals are of very little value in careful seal design be- 
cause the actual seal operating conditions are appreciably dif- 
ferent from those which must be postulated in order to solve the 
theoretical problem. Accurate estimates of leak rates must be 
based upon tests of the exact seal design of interest. 

A method of correlating test data is presented which allows the 
calculation of leak rates through an entire family of seals from 
data which can be obtained from only two tests. If the method is 
properly applied, the accuracy of the estimated leak rates will be 
the same as the accuracy of the test data from the required two 
tests. This may be of the order of 3 per cent as compared to the 
20 to 100 per cent errors which might result from purely theoreti- 
cal calculations. 

The method may be applied to either straight or staggered 
labyrinth seals of any blade shape, size, or spacing; and the re- 
quired two tests need not be on full-size seals. 

The method may be extended to making estimates of greater 
than purely theoretical accuracy in instances where a test pro- 
gram is not warranted. Under these circumstances, the resulting 
estimates will probably not be within the 3 per cent accuracy, but 
the method achieves the maximum of accuracy from whatever 
pertinent experimental data may be available. 
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APPENDIX 


Sample calculation of leak rate from correlation curve and 
contraction coefficient curve. 


1 Establish seal-design and operation conditions: 


Radial clearance 0.008 in. 
Blade spacing 0.217 in. 
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Number of blades 7 
Supply pressure 
Supply temperature 
Over-all pressure ratio 0.5507 

Air viscosity 1.05 10~* Iby /(in. )(see ) 
2 Determine pressure-ratio function from Equation (4). 
Carefully establish whether pressure is greater or less than critical 
value. If ratio is less than critical, use appropriate value of @max 
(see Fig. 2). 


25.26 psia 
528 R 


@ = 0.29787 < critical 
3 Determine value of W/a from seal correlation curve for 
proper spacing (Fig. 7). 
Ww 


a 


= 0.2520 


4 Solve for numerical factor of Equation (3) using W/a from 
step 3 and proper values of variables in general term. 
2)( 25.26 (0.008) 
Naw = (0.2800) 
(1.05 
Nre 4222 a 


5 Sketch the straiglit line representing the equation of step 4 


1960 / 271 


56 
52 
48 | 
4 Z 
| 
029 Spacing / ; 
2 
Four Curves 
36 
kz 
651 and |. Spacin 
Va 651 and 1.302 Spacing 
/ 
x | = 
A - 
A 
24 
| Z | 
° 
5 
16 
P 
| 
# 


010 


.003 Radial Clearance e 020 


Blade Thickness 


© .005 inch oo | 


008 Radial Clearance 


CONTRACTION COEFFICIENT 


.005 Radial Clearance 


16 


on the contraction-coefficient chart, Fig. 8. Typical lines piotted 
in Fig. 8 may be used as guides. The line required from step 4 is 
slightly to the right of the Na. = 4000a-line shown. 

6 The intersection of the line of step 5 and the contraction- 
coefficient curve for the proper blade spacing is the common solu- 
tion for a@ which is required. 


also 


Nre = 3010 


7 The specific leak rate can be found from the value of W/a 
of step 3 and the value of @ of step 6. 


W = ( =) (a@) 
a 


= (0.2520)(0.722) 


W = 0.1819 


8 The values of the leak rate and the mass velocity can be 
calculated directly from the specific leak rate and the respective 
definitions of these quantities. 


DISCUSSION 
A. Egli? 


The author’s procedure may be summarized as follows: 
Introducing a slightly different definition of specific leak rate 


W* = wy (6) 
g 


2 Engineering Research and Advanced Product Study Office, Ford 
Motor Company, Dearborn, Mich. Mem. ASME. 
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the author started out by representing his tested leakage by the 
following equation 


W* = af (7) 


The contraction coefficient @ he determines from a single 
“blade”’ test and plots it as a function of Reynolds number 


a= re )sia: le test 


(8) 


as shown in his Fig. 5. 

The function ® = ®(r; N) is computed from his equation (4) 
depending on pressure ratio r across the whole labyrinth and the 
number of stages N. 

In an attempt to correlate equation (7) with test results he 
runs into difficulties forcing him to two changes. 

First, he changes the tested relation (8) to 


a* = geometry) (8a) 
Second, he changes equation (7) to 
W* = a*f(d) (9) 


Both relations are represented graphically: 

(8a) in his Fig. 8 

(9) in his Figs. 7 and 9 

According to the author’s statements this method of represen- 
tation correlates about 200 test runs within 3 per cent. While this 
result seems to justify a computation method I still feel obliged to 
criticize this analysis as not rationally founded. 

Let us consider the contraction coefficient which represents the 
“vena contracta’”’ of the fluid jet emerging from each constriction 
in the labyrinth. In general the jet contraction will depend on 
three influences: 


1 The constriction geometry, represented in the case investi- 
gated by the two ratios 
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2 The compressibility of the fluid, conveniently represented 
by the pressure ratio r across the constriction of the labyrinth. 

3. The viscosity of the fluid, conveniently represented by an 
appropriately defined Reynolds number Na., for instance, by 
equation (3) of the author. 

With the author’s test rig it appears entirely feasible to properly 
obtain separately the three influences on a. 


Reynolds number, for instance, can be varied (independently 
from r and seal geometry) by changing the size of the system. 
In the present case it will be perfectly good enough to vary the 
dimensions s, h, ¢ while keeping their ratios constant. A range of 
system sizes of 4 to 1 should be entirely feasible (the cylinder 
diameter, of course, need not be changed). A kinematic viscosity 
variation is conveniently obtained by varying the test pressure 
level (leaving pressure ratios unaffected). A range of at least 2.5 
to 1 should be obtained. Thus a variation of Nx. in a range of 
10 to 1 could be accomplished. Such tests for a single constric- 
tion are straightforward. The program and analysis for multi- 
stage labyrinths is a little more tricky. One would limit the 
study at first to possibly three constrictions, just to get an op- 
portunity to obtain the effect of carry-over of velocity energy 
from one constriction to the next. 

Consulting published data of effect of Reynolds number on 
orifice flow coefficients one would, however, suspect that Reynolds 
number is not an important factor in labyrinth leakage predic- 
tions. In faet, it is the very purpose of a labyrinth to obtain re- 
sistance to flow without having to depend on the viscosity of the 
fluid. If the design situation is such (very narrow clearance, very 
many constrictions) that really low Reynolds numbers (in the 
order of 1000 or less) can be attained then it is usually proper to 
change over to a bushing type or surface friction type seal with 
much reduced leakage. 

Compressibility obviously affects the vena contracta through a 
sharp edged orifice or slot as has been observed by many experi- 
menters. The effect in fact is such as to even wipe out the critical 
pressure ratio point in the mass flow versus pressure ratio curve. 
The discusser strongly suspects that the variation of @ shown in 
the author's Table 1 is primarily due to the compressibility effect 
and only relatively little of it is caused by fluid viscosity. 

Construction geometry affects @ of a single constriction as is 
known from work with orifice flow measuring techniques. In the 
straight through type arrangement of the multistage labyrinth 
the amount of velocity energy carried in from the previous con- 
striction will depend largely on the spacing to clearance ratio 

The pressure ratio function, such as @ used by the author, is 
supposed to express the thermodynamic functioning of the 
labyrinth which involves two processes taking place at each con- 
striction: Production of a high velocity fluid jet and reconversion 
of a part of its energy back into potential energy. 

The author computed @ from equation (4) taken from Egli’s 
paper. This equation is certainly not accurate enough once the 
pressure ratio across one of the constrictions (the last one) drops 
below about 0.8. And it could not be considered rational to re- 
arrange the discrepancy between test and computation as a 
viscosity effect. 

Concluding this discussion I should like to propose that the 
tests made by the author could be, quite rationally, expressed in 
the following manner: 


W* = alo (10) 


where the contraction coefficient @ is a function of the seal 
geometry and the pressure ratio 
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a = a (geometry, r) (11) 


The effect of the seal spacing arrangement is represented by C as 
a function of geometry and number of constrictions 


C = C (geometry, N).* (12) 
The pressure ratio function 
@ = (r; N) (13) 


to be fairly realistic may have to be built up graphically. 
Reynolds number has been left out of this representation as it is 
the discussers’ opinion that the viscosity effect is too academic a 
refinement in labyrinth calculations. 


William P. Foster: 


The author’s method should materially reduce the amount of 
seal testing required and expedite evaluation of new configura- 
tions. 

A number of similar air leakage tests on static straight pass 
labyrinth seals have been run in our laboratory. These tests con- 
firmed that, in general, close tooth spacing is inefficient. 

We find, however, that tooth height is also a factor which 
affects flow and that there is apparently an optimum tooth spacing 
to height ratio. For example, a four tooth straight pass seal 
with a 0.2-in. tooth spacing and a 0.25-in height was better than 


* Reference might be made to a recent analytical computation of 
the carry-over effect by K. H. Groddeck, Hannover. A summary 
of his thesis is published in “‘Forschung auf dem gebieie des Ingenieu- 
rwesens,"’ 1957, no. 5, pp. 183-185. 

‘Engineer, Bearings, Lubrication & Seal Dev. Unit, SAED 
Mechanical Systems, General Electric Company, Lynn, Mass. 
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a 0.15 or a 0.4-in. height. A 0.1-in. spacing 0.2-in. height seal 
leaked appreciably more than the apparently optimum 0.2 spacing 
0.25-in. height combination. 

It is interesting to note that increasing the tooth spacing from 
0.2 to 0.4 in. and holding the height at 0.25 in. caused the leakage 
to increase slightly below 0.020 in. radial clearance. This is in 
violation of the general rule on tooth spacing but it does tend to 
show that an optimum tooth spacing to height ratio exists. 

It also confirms the author’s conclusions that, for the best ac- 
curacy, each seal configuration should be evaluated by testing. 

Our own leakage data are presented as a series of pressure ratio 
curves plotted versus radial seal clearance and a flow function as 
shown in Fig. 11. The flow function includes mass flow, upstream 
temperature, seal diameter, upstream pressure, and a restriction 
factor read from the curve in Fig. 12. Leakage for any configura- 
Yion tested may be quickly determined by substituting in the 
flow function and solving for mass flow. 


Author’s Closure 


Mr. Egli paraphrases the explicit relationships given in the 
paper in more general terms; concedes, with apparent reluctance, 
that the method does correlate test data; objects because the 
method does not conform to his own particular rationalism; 
and reiterates the method of reference [3] as a preferable system. 

Mr. Egli’s discussion emphasized rational and academic 
methods, so the author feels obligated to state the following 
syllogism which was evident but not emphasized in the paper. 
Major premise: Known methods did not correlate the test data— 
paragraph three of the paper. Minor premise: The method of 
reference [3] was known—list of references. Conclusion: The 
method of reference [3] did not correlate the test data. 

The purpose of submitting the paper was to introduce a new 
correlation method which did agree with the test data and which 
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had the other advantages noted. No further contribution would 
have been made by discoursing on the shortcomings of other 
methods with respect to the new data. However, a reasonable 
view of the facts would show that the method of reference [5] 
was designed to work for seals of larger clearances, blade spacings, 
and thicknesses than those tested by the author. Since it is well 
known that gas flow in very small passages usually has different 
characteristics than flow through similar large passages, one 
could not logically expect the accuracy of the correlations of 
reference [3] to tolerate extrapolation into the small clearance 
range. The size relationship is shown in Fig. 13. 

No further general comments on Mr. Egli’s discussion are 
required, but his outline of tests which might have been more 
“‘properly’’ conducted on the author’s apparatus contain some 
erroneous speculations which will be briefly noted so that other 
experimenters may be apprised of areas requiring special 
attention. 

Mr. Egli states that seal geometry is kept effectively constant 
if the ratios (S/h) and (t/h) are kept constant with the values of 
S, t, and h changing. The author found this not to be true. 

Thickness-to-clearance ratio is not a suitable single argument, 
even at a constant pressure ratio, for defining the contraction 
coefficient. This is fully evident from Fig. 18 of reference [3) 
in which Mr. Egli gave contraction coefficients for two blade 
thicknesses as functions of the clearance-to-thickness ratio. 
Mr. Egli’s figure shows that the curves coincide for the two blades 
for clearances greater than four times the thickness, but that 
separate curves are required for the two blades for clearances 
between one and four times the blade thickness. Mr. Egli him- 
self has therefore indicated the inaccuracy of this particular 
hypothesis. The author’s experiments involved clearances 
relatively smaller than those shown by Egli thereby making the 
ratio correlation even less applicable. 

The contraction coefficients can be expressed with suitable 
accuracy only by plotting an individual curve for each particular 
combination of thickness and clearance, and there is no advantage 
in the use of the ratio. These individual curves can be plotted 
against pressure ratio or against Reynolds number as shown in 
Fig. 8. The reasons for preferring Reynolds number in this 
instance are discussed in the paper. 
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Fig. 14 Seal design chart; blade thickness = 0.0055 in., radial 
clearance = 0.005 in., pressure ratio = 0.551 


As stated in the paper, the velocity carry-over effect is de- 
scribed by the slopes of the experimental curves asin Fig.7. If 
this effect can be adequately represented by a multiplier as sug- 
gested in Equation (10), the experimental curves of Fig. 7 would 
have to be straight lines through the origin. The data do not con- 
form to such a straight-line interpretation as may be seen in the 
figure. Furthermore, if any generalization is to be permitted 
the experimental correlation curves for given spacing-to-clearance 
ratios would be required to have a constant slope for varying 
blade thicknesses and clearances. The data do not conform to 
this requirement within the standards of accuracy set for these 
tests. 

Finally, Mr. Egli bewails the use of Reynolds number as a 
correlation criterion solely on the basis of his assumed lack of 
viscous effects while he also astutely notes that the determination 
of contraction coefficients in multiblade seals is rather ‘‘tricky.”’ 
Briefly, viscous effects are frequently significant in small channel 
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flow; Reynolds number is not solely a viscosity parameter but 
expresses the relative importance of inertia and viscous forces; 
and, as shown in the paper, Reynolds number similarity is the 
key which permits direct, rational definition of the multiblade 
contraction coefficient on the basis of single blade test data. 

The use of the pressure ratio function is justified by the results, 
and the relative merit of continuous annular paths compared to 
bladed seals is discussed with respect to Fig. 14. 

Mr. Foster’s comments about the effect of blade height show 
the significance of a variable which was not studied in the author’s 
test program. It was demonstrated that the blade height used 
was great enough to have no influence on the single blade con- 
traction coefficient determinations, and small enough to preclude 
bending of the thin blades from high pressure differentials. Fig. 
3 is a scale drawing of a typical seal configuration. 

Mr. Foster’s presentation of their test data for the leak rates 
of a four-blade seal indicates one method of plotting seal test 
data. It is interesting to note that these curves of Fig. 11 show 
the critical pressure ratio of a four-blade seal to be near 0.40, 
and that this is essentially in agreement with the prediction of 
Fig. 1, even though the seal has relatively few blades 

The data plot of Fig. 11 is designed to allow ready determina- 
tions of the effects of clearance and pressure changes on the opera- 
tion of a fixed seal design. The method used by the author to 
present data which could not be correlated by the method of the 
paper is shown by Fig. 14. This plot is designed to allow selec- 
tion of optimum seal geometry for a fixed clearance and pressure 
ratio. 

The plot refers to a fixed clearance, fixed blade thickness, and 
fixed pressure ratio and shows the leak rates for various numbers 
of blades at a range of interblade spacings. It is possible to 
superimpose lines of constant total seal length on this plot as 
shown by the broken lines. These lines then allow a designer to 
optimize the configuration which will fit into a fixed total length 
of seal. It is evident that less leakage will result from fitting 
six or seven equally spaced blades into a 0.50-in. space than 
would result from using only two or three blades. It is further 
evident that there is little value in putting more than seven 
blades into the same space, since no reduction in leak rate would 
result. 

The intersections of the broken lines of constant total seal 
length with the ‘‘zero’”’ spacing axis show the experimentally 
determined leak rates for uniform annular passages of the clear- 
ance and length shown. Thus a uniform annular passage 0.50 
in. long and 0.005 in. high would leak 50 per cent more than 
would six equally spaced blades placed in the same space with 
the same clearance. As long as the blades are thin and rectangu- 
lar and the clearances small, blades are preferable to continuous 
annular passages even if the total seal length is as small as 0.040 
in. and the pressure ratio as high as 0.83. 
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Hydrostatic Gas Bearings 


Orifice-regulated hydrostatic gas bearings offer significant advantages for instrument 
applications. In particular, gimbal bearings for inertial guidance systems can be 


designed with negligible starting torque and high stiffness, and for operation at extreme 


temperatures. 


A literature search revealed the lack of convenient and accurate data for the design 


of hydrostatic gas bearings of various configurations, taking into consideration the 
effects of compressibility, which cannot be neglected at higher pressures. 

Based on Euler’s equation, expressions for the significant parameters, i.e., pressure 
profile, gas-flow rate, gap height, and load-carrying capacity of pad and step bearings, 
are developed. These parameters yield results which are in excellent agreement with 


experimental data. 


Introduction 


NCREASING INTEREST has recently been focused on 
lubrication of bearings with compressible gases, which has in- 
herent advantages over lubrication with incompressible fluids. 
Frictional torques and losses of bearings are proportional to the 
viscosity of the lubricant. The viscosity of gases and vapors is 
lower, by several orders of magnitude, than that of oils or greases, 
resulting in a substantial reduction of running friction torques 
and losses. The starting torque of a well-designed hydrostatic 
gas bearing is practically zero if coulomb or dry friction is 
eliminated. Gases are well suited for lubrication at high tem- 
peratures at which hydrocarbons will become unstable. Con- 
tamination and soiling can be avoided by the use of gases or vapors 
as lubricants. 

Air-lubrication was first suggested by Hirn [1]? in 1854. During 
the succeeding half century, it was practically forgotten until, in 
1897, Kingsbury published the results of his extensive experi- 
ments with an air-lubricated journal bearing [2]. Since then, the 
state of the art has progressed at an accelerating rate, and the 
growing interest in this form of lubrication is reflected in the 
steadily increasing number of publications dealing with the sub- 
ject. (See Bibliography.) 

1This paper presents the results of one phase of research per- 
formed at the Jet Propulsion Laboratory, California Institute of 
Technology, under Contract No. DA-04-495-Ord-18, sponsored by 
the Department of the Army, Ordnance Corps. 

? Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division of THe AMERICAN So- 
CIETY OF MECHANICAL ENGINEERS, and presented at the ASME- 
ASLE Lubrication Conference, New York, N. Y., October 20-22, 
1959. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, Decem- 
ber 12, 1958. Paper No. 59—Lub-1. 


Nomenclature 


The test fixture incorporates pneumatic loading by means of a 
bellows-suspended piston which is prevented from cocking by an air bearing. 


A survey of the literature revealed much useful information 
for the design of gas bearings, but also the lack of complete design 
data and of a rigorous treatment of hydrostatic bearings with re- 
striction of the gas flow by single or multiple orifices. Compressi- 
bility effects on the pressure profile within the bearing are very 
significant as far as the performance characteristics at higher 
pressures are concerned, namely, the load-carrying capacity, rate 
of gas consumption, and stiffness of the bearing. A rigorous 
analysis should, therefore, treat the gas-lubricant as a compressi- 
ble fluid subject to isothermal expansion. 


Characteristics of Circular Pad Bearings 


The study was started with an analysis of the characteristics of 
circular pad bearings. They are individually interesting as thrust 
bearings but can also be considered the basic modules of more com- 
plex bearing configurations, including journal bearings of the type 
illustrated in Fig. 1. 

A pad bearing is schematically shown in Fig. 2. A circular plate 
of radius R carries a load L and is supported by a compressed-gas 
pad in a gap. The pad, which has a height ho, is connected to a 
gas plenum P! through an orifice O of diameter D and a cylindrical 
passage of radius R,. The gas is supplied to the plenum through 
an inlet at a volumetric flow rate Q, at the plenum pressure P,). 
From the plenum, the gas flows through the orifice O and the pas- 
sage C to the center of the pad and, from there, radially outward. 
The flow through the orifice O causes a drop from the plenum 
pressure P,, to Po; i.e., the pressure at the center of the pad. 
This pressure drop, AP = P,,, — Po, is a function of the flow rate 
Q. It serves to regulate the pressure within the gas pad with 
varying load L and gap height ho. If the load increases, the gap 
decreases, and with it the flow rate Q; hence, the pressure drop 
AP becomes smaller and P» larger since P,, is constant. If the 


pi 


area of flow upstream of orifice 


'{ f orifi ‘ Per = critical pressure R = radius of circular plate 
A», = area of orifice openin : ‘ 
if ici = gas plenum pressure; absolute r radius of infinitesimal circular 
= orifice coefficien ; 
> ‘dth of infinitesimal circul pressure of the gas upstream element of pad 
dr = width of infinitesimal circular 
liao ale act P, = absolute pressure at inlet con- W,, = mass flow rate through orifice 
‘lement of pac ; 
Pr f I dition (r = Ro) Wo = mass flow rate through gap 
J = orce ‘ 
; P, = exit pressure 6p/6x = pressure gradient 
ho = height of compressed-gas pad 
F Scaiateal Pl = gas plenum AP = pressure drop 
= an integra 
_, p = pressure pw = absolute viscosity of gas 
L = load on a circular plate 
2 = volumetric flow rate = density of gas at pressure P,, 
P = pressure at radius r 
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Fig. 2 Diagram of orifice and load-plate of pad bearing 


load decreases, the process is reversed. The result is an automatic 
pressure regulation for varying load. 

The load capacity of the pad obviously depends upon the radial 
pressure distribution in the pad; i.e., the pressure profile P = 
fir). It can be calculated from Euler’s equation for steady one- 


Journal of Basic Engineering 


Schematic of hydrostatic gas bearing 


dimensional flow with friction, neglecting the body forces of 
weight and inertia: 
bp 


6x by? (1) 
in which p denotes pressure; yu, absolute viscosity of gas; 5p/éz, 
pressure gradient; and u, velocity in direction of flow. 

If Euler’s equation is applied to an infinitesimal circular ele- 
ment of the pad with radius r, width dr, and height ho, and if iso- 
thermal expansion of the gas is assumed, the volume rate of flow 
and the pressure inside of the pad can be found from the equation 


bp Py (2) 
P 
where Q) is the volume flow rate and P» the (absolute) pressure at 


inlet condition (r = Ry), and P is the pressure at radius r. Inte- 
gration fromr = Ry tor = R yields an expression for Qo 


th, (Po? — Pi? 
Qo = 
R ( ) 


(3) 


and for P; i.e., the radial pressure distribution in the pad, 


r 
In 
| Ro | 
P=P,\1- 
(7) 
n Ro 


The mass flow rate through the gap Wo = poQo, or 


T 
W, = (5 
R 


lr 
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The mass flow rate through the gap must, for reasons of con- 
tinuity, equal the mass rate of flow through the orifice since the 
two are in series. The mass flow rate through the orifice can be 
expressed as 


War = Pp, — (6) 


where 


(7) 


(4 ) /a 
( k :) Bn 
k (4: y 
1 


and Cp denotes the orifice coefficient; P,,, the absolute pressure 
of the gas upstream (i.e., in the plenum); p,;, the density of the 
gas at pressure P,,; Ao, the orifice opening; Aj, the area of the 
flow upstream, r = Po/P,,; and k = c,/c,, the ratio of the specific 
heats of the gas at constant pressure c, to that at constant volume 

By equating the expressions for Wo [Equation (5)] and W,, 
[Equation (6)], ho is obtained: 


= Jin 


| Ry Py = Pot 
Po 


Po/P), and W,, = where Q,, is the 
volume rate of flow, the gap height ho, can also be written 


(8) 


Since po = 


> 
he = Jin —* — 
Po 
This expression permits the calculation of ho for varying pres- 
sures Py and a given orifice configuration and plenum pressure 


P 


(9) 


pl 
If a short nozzle or orifice of well-rounded approach is used, the 
rate of flow increases until the velocity of the downstream jet 
reaches the velocity of sound in the gas. The critical pressure at 
which this occurs in an orifice of small diameter in air (k = 1.4) is 
Poer = 0.528P 5). 
For values of Py lower than P,.,, the flow rate Q remains con- é 
stant, Q,, = Qer, and in the critical range (0 < Po < Per) the gap 
height hocr becomes 


9 > 
0 2 1 


The pressure distribution, rate of gas flow, and gap height can 
thus be rigorously calculated for any value of Py) between the 
plenum pressure P,,, and exit pressure P,. 


Load-Carrying Capacity of Bearing 

The load-carrying capacity of the bearing as a function of Pp is 
of considerable interest. The capacity can be calculated from 
the force balance between the load W and the force F exerted on 


R/R,= 362 
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the load plate by the pressure P of the gas. The radial distribu- 
tion of P between r = Ry andr = R is given by Equation (4). 
From r = 0 tor = Ro, the pressure is constant and equal to Pp. 
Typical pressure profiles for pad bearings with R = 0.565 in., and 
Ro = 0.0156 in. and Ry = 0.283 in., respectively, are shown in 
Fig. 3. 

Since P and P, are absolute pressures, the exit pressure P, is 
not zero but must also be considered in F, and the latter can thus 
be written 


R 
Fer +2 f Prdr — 
Re 


Inserting P from Equation (4): 


| 
R In 
P 2 
Reo R 
In 
Ro 
ay 
In the integral which forms the second term of Equation (11) 
P, \* 
1 - ( ‘) 
R 


n Ro 


is constant for a given load, and the integral can therefore be 
written 


r 
I 1—Aln rdr 
Ro 
R 


(1 Al 
1 ” Re Ro Ro 


The expression under the square root, or half-power, can be 
developed into a binomial series 
2 Ro 8 Ro 48 Ro 

which, however, is poorly convergent except for small pressures 
(P,/Po< 1). The integration has been carried out to the fifth- 
power term and to pressures up to P» =48 psig. The calculated 
values show very close agreement with measured values up to 
> pressures of approximately 15 psig; however, for higher pres- 
sures, the error increases to about 15 per cent at 48 psig, as would 

be expected from the poor convergence of the series. 
Fortunately, a closed solution for the integral J can be found 
which yields very good agreement between theory and experi- 


ment. 
( r \'/s 
t= {1-—Aln— 


By substituting 
in the expression for 7, the integral can be written in the form 
2 P P, 
J=- Ody ith = — 
2 1 1 
ry V27A 
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The error function Erf is available in tabulated form [3]. With 
this closed solution for the integral J, the load-carrying capacity 
F can be written in the form 


F = Ete ) — Erf (> (12) 


or 
= Ast. (+) my ( _ 
(13) 
where 
r Re’ 4 A, = 


The expression 


has been plotted for various ratios of R in Fig. 4 as a function of 
P, for pressures Py between 8 and 48 psig. From this dimension- 
less presentation, the load F, which a circular bearing of radius R 
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Fig. 5 Load-carrying capacity of step bearing with +R? = 1 in.® 


and step radius /%) can carry at various pressures Po, can readily 
be determined. 

Fig. 5 shows, as an example, the load-carrying capacity of a 
circular bearing of one in.? area for a range of R/Ry from zero to 
100 and a pressure range from 8 to 48 psig. It is interesting to 
note how sharply the maximum-load capability increases at low 
ratios of R/Ro, corresponding to large step areas. However, this 
gain in lift is paid for by a decrease in gap height ho which, with 
increasing step radius Ry, soon becomes too small for safe opera- 
tion of the bearing. 


Tests 


In order to evaluate the theoretical findings and to compare 
them with experimental data, a test fixture was built which per- 
mits the accurate measurements of the pertinent data of different 
bearing configurations. The design of this test fixture was de- 
liberately held flexible enough to allow for the testing of pad, step, 
journal, spherical, and conical bearings, as well as for experiments 
with various gases and vapors. 

Design of Test Fixture. The test fixture consists of an upper load 
and a lower orifice plate which are separated by an adjustable gap 
(see Fig. 6). The orifice plate is attached to a pleaum chamber 
which is connected to a regulated and filtered gas supply and 
mounted on a tool-maker’s microscope table. The table can be 
accurately leveled by means of three leveling screws and aligned 
with the load plate by means of a pair of x-y lead screws with 
micrometer adjustment. 

The load plate is connected to a piston and prevented from tilt- 
ing by an air-bearing, which surrounds and guides the piston. 
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The piston is connected by means of bellows to an air chamber 
and can be loaded with varying pressure from a regulated air 
supply. The load on the load plate can thus be read on a carefully 
calibrated pressure gage connected to the air chamber. Calibra- 
tion was accomplished by interposing a strain-gage-type dy- 
namometer between the load plate and the table. 

The air chamber, piston, and air bearing are mounted on a 
support plate, which can be adjusted by a lead screw along a 
sturdy vertical support column (see Fig. 7). 

The orifice plate is fastened to the plenum chamber with a 
clamp ring and can be easily replaced by other plates with dif- 
ferent orifices and contours. Likewise, pistons with different 
load plates (cylindrical, spherical, or conical) can be connected 
to the air chamber and their behavior studied under varying con- 
ditions of load and pressure. 

Fig. 8 iliustrates the orifice plate of a pad bearing, and Fig. 9 
shows the orifice plate of a step bearing with a 2:1 recess. Spirally 
arranged tap holes can be seen on the lapped surfaces of both orifice 
plates; they serve to manometrically measure the radial pressure 
distribution within the pad and to establish pressure profiles. 

As a further example, Fig. 10 shows a semicircular load and 
orifice plate, with 9 orifice holes spaced 15 deg apart, on which the 
behavior of journal bearings with multiple orifices can be studied. 

Fig. 11 shows the piston-bellows assembly with a circular load 
plate. 

The distance between load plate and orifice plate; i.e., the gap 
height ho, can be measured optically by means of a high-power 
microscope with a micrometer eyepiece. The microscope and a 
light source for illumination of the gap can be seen in the photo 
of the complete test fixture (Fig. 12). The same photograph also 
shows the instrument panel with pressure gages, flow meters, and 
pressure regulators. This optical method of measuring the gap 
proved quite satisfactory down to gap heights of approximately 
300 yin., but was unsatisfactory below that height. A capacity 
bridge was, therefore, specially developed (see Fig. 13) with which 
the capacity of the load-orifice plate cavity could be accurately 
measured. From this measurement, the gap height was calcu- 
lated and checked by means of optical measurements with the 
microscope. This electrical method proved to be highly sensitive, 
the resolution being approximately one yin. 

Experimental Data. ‘The results of a series of measurements of ho 
and Q on three orifice plates of outer radius R = 0.565 in. (xR? = 
1 in.*) and step ratios of R/Ry = 2, 3, and 36.2, respectively, are 
plotted against P, in Figs. 14 to 17. The plenum pressure in these 
measurements was varied from 8 to 48 psig in steps of 8 psig. 
The agreement between experiment (crosses and circles) and 
theory (solid lines) is very close, considering the limitations of the 
experimental method. 

It is interesting to note in Fig. 17 how the plot of the flow rate 
Q at a plenum pressure of 32 psig reflects the critical operation of 
the orifice. The critical pressure P., for 32 psig, or 46.7 psi abso- 
lute, is Per = 0.528 X 46.7 = 24.6 psi absolute or 9.9 psig. Fig. 
17 shows that, for pressures P, below this value at a plenum pres- 
sure of 32 psig, the flow rate remains constant as the theory de- 
mands, 

In Fig. 18, the gap height Ao of the bearing with R/R, = 36.2 
is plotted against the load W. The measurements were again 
made at various plenum pressures between 8 psig and 48 psig and 
show how the stiffness of the bearing increases with increasing 
pressure. If the load, for instance, is 5 lb the spring rate of the 
bearing operating at a plenum pressure of 32 psig is 6650 Ib/in.; 
it increases to 17,300 Ib/in. at 40 psig and to 20,800 Ib/in. at 48 
psig. 

Fig. 19 shows the results of measurements of the flow rate Q 
with varying load on the bearing with R/Ry = 36.2 at plenum 
pressures from 8 to 48 psig. 

The calculations of the load-carrying capacity [Equations (12) 


Journal of Basic Engineering 


“A 
AY 
| 
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through (14)] were checked by measuring the load which results 
in closing the gap. Closure loads measured on the three bearings 
with ratios R/R» of 2, 3, and 36.2, respectively, are plotted in Fig. 
20 and agree very well with the theoretical prediction represented 
by the solid lines. 

It is interesting to compare the actual load-carrying capacity 
of a bearing with radius R with the maximum load which the 
bearing could support if the pressure P did not drop from Ps, to P; 
between Ro and R but remained constant at a value Py. This 
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Fig. 14 Gap measurements, pad bearing, R/R, = 36.2 
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ratio is sometimes called the “efficiency’’ of the bearing; it is, of 
course, a function of R/ Ro, but varies little with Py. The efficiency 
of the three bearings with R/Ry) = 2, 3, and 36.2 is 61.5, 49.5, and 
22.0 per cent, respectively, at Po = 40 psig; and 59.5, 46.5, and 
19.0 per cent at Py = 20 psig. 


Conclusions 

It can thus be concluded that the theoretical expressions 
derived for ho, Q, and F describe the performance of hydrostatic 
pad and step bearings very accurately within the range of pres- 
sures and step radii investigated. 
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Fig. 17 Pad bearing, Q = f(P,), rate of gas flow 


The study of hydrostatic gas bearings is now being extended to 
journal bearings of the type illustrated in Fig. 21. Results of the 
theoretical and experimental investigation are to be published. 
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The author has presented a clear discussion of the externally 
pressurized circular-plate thrust bearing which is orifice compen- 
sated. He has made a unique contribution in conducting care- 
ful experiments to verify the known equations for this type of 
flow. 

A word about the limitations involved is in order. Inasmuch 
as fluid inertia terms have been neglected, the equations lose 
validity in the region of the inlet hole as the inlet pressure is in- 
creased or as the film thickness is increased. In addition, the ex- 
istence of sonic velocity at the inlet radius Ro, followed by super- 
sonic velocities, and eventually shock waves was first recognized 
by Deuker and Wojtech,‘ and has since been analyzed by others. 
Since the expansion occurs so rapidly in the inlet section, the flow 
there is more nearly adiabatic whereas throughout the remainder 
of the space between the two surfaces it approaches isothermal. 

The effect of fluid inertia is of great importance for externally 
pressurized bearings, not only because of the modifications in the 
equation of fluid motion, but also because it can induce turbulence. 
For example, transition from laminar flow can occur with Reyn- 
olds numbers as low as 550 based upon the film thickness and 
average film velocity. The applicability of these limitations 
must be assessed before the equations in this paper are used. 

Although the paper concentrates upon the orifice compensated 
bearing, as illustrated in Fig. 1, the pressure and load results as 
given in Section 3 depend only upon the pressure at the inlet 
section. For example, the left half of Fig. 3 shows a configuration 
which is compensated only by the change of film thickness around 
the periphery of the inlet hole. Because of the absence of a 
trapped air volume this configuration has good stability char- 
acteristics. 

This discusser has conducted tests on externally pressurized 
flat-plate thrust bearings by fixing the plate which contains the 
inlet hole and moving a plate containing a one-mil diameter 
pressure sensing hole so that pressure at any radial position may 
be measured. The small pressure sensing hole is necessary in 
order to detect the large pressure gradients which can occur in 
the vicinity of the inlet hole. The pressures can fall below am- 
bient pressure in a distance of a few mils. Too much modifica- 
tion of the flow, as well as effectively averaging the pressure, 
occurs if the pressure sensing holes are large. 


Author's Closure 


I appreciate the comments of Dr. W. A. Gross. 

As far as the limitations of the analytical treatment chosen in 
the paper are concerned, it is the opinion of the author that it is 
not only valid as applied to bearings which are of interest to 
instrumentation but also to many other practical applications of 
externally pressurized gas bearings. 

The assumptions made in the analysis are that the flow is in the 
laminar regime throughout the film, that inertia forces are neg- 
ligibly small compared with viscous forces and that the gas expands 
isothermally. The very good correlation of the experimental and 
theoretical data is proof that these assumptions are fully justi- 
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fied in ‘the case of orifice regulated hydrostatic bearings used as 
pivot axis supports in inertial and similar instruments which op- 
erate with clearances of approximately 1.5 mil and less, orifice 
diameters of about 4 mil and supply pressures up to 50 psig. 
The effects of fluid inertia, turbulent flow, and shock waves are 
undoubtedly of considerable academic interest; however, as far 
as practical applications are concerned, the author cannot agree 
that it is of great importance. As Comolet® and others have 


5 Raymond Comolet, “Ecoulement d’un Fluid Entre Deux Plans 
Parallels. Contribution a l’Etude de Buttees d’Air,”’ Service de 
Documentation et d’Information de 1’ Aeronautique, Paris, France. 
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shown, the load carrying capacity approaches zero or can even 
become negative when critical values of the Reynolds and Mach 
numbers are exceeded; the bearing, therefore, loses its usefulness 
completely. Clearances of 4 to 20 mil and more, which have 
been used by several authors to demonstrate these abnormal 
effects, at high supply pressures and excessively large flow 
rates, are hardly of interest to the designer of bearings for 
practical applications who is concerned with satisfactory load car- 
rying capacity, maximum possible stiffness, and economy of gas 
consumption. 


Transactions of the ASME 


» 
5 
; 
i 
, 
} 
4 
: 
{ 
\ q 
a 
as 
' 
i 
4 
4 


R. A. BAUGHMAN 


Manager-Materials Applications Unit, 
General Electric Company, Flight 
Propulsion Laboratory Department, 
Cincinnati, Ohio 


Effect of Hardness, Surface Finish, and 
Grain Size on Rolling-Contact Fatigue Life 
of M-50 Bearing Steel 


The effect of hardness, surface finish, and grain size upon the compressive rolling-con- 
tact fatigue strength of M-50 bearing steel has been studied. Considerable testing on the 
RC Rig and statistical-treatment methods have been included. A mathematical expression 
relating these variables to life expectancy is presented and the optimization of these 
variables is discussed. It is shown that bearing fatigue of M-50 increases by increasing 
hardness, decreasing surface, and increasing grain size. The optimum life identified 
occurs at R64 hardness, 1.5 rms surface finish, and a grain size of ASTM 2. 


ae set for antifriction-bearing procure- 
ment, including hardness, grain size, and surface finish are based 
primarily on results obtained through experience on full-scale 
bearing tests. This technique is satisfactory and has resulted in 
excellent progress in bearing development. It is, however, falla- 
cious to assume that bearing tests are the ideal manner to study 
isolated variables because of the numerous other variables which 
are part of the manufacturing cycle and which may affect life. 
The difficulty of isolating one particular variable for the purpose 
of evaluation by bearing tests is overwhelming. The develop- 
ment of the RC Rig! has made possible the isolation and study 
of one variable at a time. In order to determine the significance 
of these variables as related to bearing life for design and manu- 
facturing purposes, the primary and secondary effects of these 
variables upon fatigue life must be separated and analyzed. This 
is necessary in order to establish optimum conditions for these 
variables. Stress level is another variable included to introduce 
some flexibility to the applicability of the results. 

In order to separate the effects of the variables upon fatigue 
life, a statistical approach is used. A complete description of 
the statistival techniques employed in designing this statistical 
experiment and in analyzing the statistical results is beyond the 
scope of this paper. It is sufficient to note, however, that the 
several variables affecting life are separated by various regres- 
sion methods and by advanced methods of order statistics. 

The material used for this investigation was a Vanadium Alloy’s 
air-melted heat of M-50, heat no. 52006. A series of heat-treat 
studies was conducted in order to obtain the desired range of 
hardness and grain size. Austenitizing was accomplished at 2050 
F with time increasing for increased grain size. Double-temper- 
ing treatments were always used with temperature increased to 
decrease hardness. Hardness was checked by macro and micro- 
hardness tests, grain size by metallographic examinations. To 
obtain the surface-finish range desired, different grades of grind- 
ing wheels were used on the centerless grinder.*. Metallographic 
1R. A. Baughman, “Experimental Laboratory Studies of Bearing 
Fatigue,"” ASME Technical Digest, Mechanical Engineering, March, 
1959, p. 94. 

? The RC Rig utilizes cylindrical specimens 3 in. long and ?*/¢ in. 
diameter. 

Contributed by the Lubrication Division of THe American So- 
CIETY OF MECHANICAL ENGINEERS and presented at the ASLE- 
ASME Lubrication Conference, New York, N. Y., October 20-22, 
1959. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, August 
18, 1959. ASME Paper No. 59—Lub-11. 
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polishing techniques were employed on extremely fine surface- 
finish levels (1.5 rms). Surface finish was checked via standard 
profilometer instruments. 

To eliminate the effect of dissimilar variables in the RC Rig 
rollers, the rollers were mated identically with each set of test 
bars. That is, at any given condition the test bars and rollers 
had identical grain size, hardness, and surface finish. 

Mil-L-7808 lubricant at 20 drops per minute (equivalent to 
flooding) was used throughout the test program. All tests were 
run at room temperature. 


Experimental Test Results 


The results of experimental fatigue tests are given in Table 1. 
Four tests were conducted at each combination of grain size, 
hardness, surface finish, and stress level. As expected, great 
variations in fatigue life were exhibited in spite of the fact that 
four repetitive tests were run under identical conditions. This 
variation in life (scatter) is due to the property that any finite 
collection of test results for a fixed set of conditions belongs to 
some definite population, defined by the manner in which the 
fatigue lives of the individual members are distributed within the 
population. An estimate of this population is obtained by con- 
structing a graphical picture of the data commonly plotted in 
Weibull’ form which is a plot of the per cent of tests failed as the 
ordinate versus the cycles to failure as the abscissa. This distribu- 
tion is linear when plotted on Weibull distribution paper. Values 
obtained from such plots are given in Table 2. 

The use of only four tests for each condition of grain size, hard- 
ness, and surface finish is admittedly a bare minimum. Its justi- 
fication is based upon reproducibility studies, previously pub- 
lished' on the RC Rig method for rolling-contact testing. The 
simplicity of the testing method tends to reduce the seatter and 
therefore the number of tests necessary to establish the position 
and slope of the Weibull graph. It has been observed that slope 
and position of the values studied move only slightly when based 
on more than four test points. 

In order to compare the test results on an equal basis, these re- 
sults are evaluated and compared at the same stress level. This 
was accomplished by interpolation of By and By lives of the repeti- 
tive tests between stress levels, assuming a linear variation be- 
tween stresses. The lives at 700,000 psi are tabulated in Table 3 
according to decreasing By life, and Table 4 according to de- 
*W. Weibull, “‘A Statistical Distribution Function of Wide Appli- 


eability,"’ Journal of Applied Mechanics, vol. 18, Trans. ASME, 
vol. 73, 1951, pp. 293-297. 
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Hardness Grain Size Surface Finish Maximum Berts 

Re Stress (10-3psi) 
64 7 9.35 700 
62 10d 3.75 665 
62 1/4 3.75 665 
62 Wa 10.4 14.75 665 
1.75 665 
732 
732 
14.75 732 
4.75 732 
9.75 700 
pe 9.75 700 
9.75 700 
1.5 700 

17 


9.75 

9.75 

Su 3.75 

su 3.75 

x Sh 
Su 14.75 
3.75 
5u 3.75 
Su 14.75 
Su 14.75 

£2 9.75 


5 Table 2 Tabulation of B,) and B50 and the Wiebull slope of R, Rig fatigue Table 4 Tabulated test results of By» life in descending order 
results on M-50 at various grain sizes, hardnesses, and surface finishes Grain Sise Surface Finish Marts Ben, Life”® 
No. (Re) AST™) tr 
Hard- Grain Surface q) Wiebull 
ness Size Finish Stress plife Boo Life Slope 62 3.75 


Table 1 Experimental test results 


£2 


1.121 3.190 5.620 10.429 
1.159 3.409 7.239 12.56 
6.248 12.288 22.69 
1,001 2. 6.771 
3.u69 69 11.723 
991 1.43 3.26 
3.487 4.199 6.109 
961 3.165 7.019 11.2% 
926 2 5. 8.11 
1.011 5.875 
3.960 6.105 
1.7% 4.89 10.29 
4.229 


656 1.102 2.69 2.79 “he 
697 1. 1.9% 2.69 

487 697 = 1.80 2.03 

12 1.012 1.72h 2.181 

629 2 667 

2379 1.169 1.665 

229% 712 

1435 


64 7.0 9.35 700 e 4.3 
624 10.4 3.75 665 64 3.9 
624 3.4 3.75 665 4.8 10.3 
: 62! 10.4 14.75 665 5 4.4 
! 62h 3.4 14.75 665 1.4 5.2 
24 10.4 3.75 732 56 2.3 
624 3.4 3.75 732 2.1 6.1 
, 624 10.4 14.75 732 56 4.4 
624 3.4 14.75 732 52 3.3 
58 12.0 9.75 700 78 3.3 
58 2.0 9.75 700 60 2.9 
‘ 58 7.0 9.75 700 1.0 4.3 
58 7.0 1.5 700 1.6 3.7 
58 7.0 17.0 700 2.3 5.2 
4 58 7.0 9.75 645 1.5 4.4 
’ 58 7.0 9.75 745 55 1.5 
54 10.4 3.75 665 50 1.7 
54 3.4 3.75 665 47 1.6 
54 10.4 14.75 665 32 1.30 
i 54 3.4 14.75 665 35 1.35 
54 10.4 3.75 732 14 1.00 
54 3.4 “3.75 732 25 94 
“4 10.4 14.75 732 16 38 
54 3.4 14.75 732 .25 14 
2 7.0 9. 


- Hertz stress (10-5 psi) 


Table 3 Tabulated test 


2 

1.42 3 
2. 00 

0.91 
1.55 

1.50 
2.10 6 
0.97 

1.09 7 
35 

1,34 9 
2.30 

2.19 10 
1.79 

2.21 
1.71 

1.73 12 
1.79 

1,50 
1.05 
1 

2. 

1. 

1. 


results of B;» life in descending order 


Hardness Grain Size Surface Finish Maximum Herts Byo life*® 
No. Re (aS™) (r.m.s.) Stress(1073Psi)* Cycles) 
62 14 3.75 700 x 10 


62 1/4 14.75 700 
6 58 12 9.75 700 
7 64 7 9.75 700 


® All data reduced to 700,000 psi from test date. 


** Test speed = 30,000 cycles/minute. 
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arcs The general equation relating hardness, surface finish, and 
wie grain size to fatigue life as obtained from a statistical analysis of ° 
mo the test data is of the following form: 


10 Y = By + Biz, + Bore + Bars + Bars... 
+ Buz,? + + Burs? + Bur. + 
+ Bists + + Bestors + Buty... + 


xX, 
17 x 10 
Xs 


5€ 
62 
62 
62 
6b 


creasing By life. 
dominantly dependent upon hardness while By life is dependent 
upon the interaction of hardness, grain size, and surface finish. 


General Fatigue-Life Equation 


Wa 10.4 1.75 700 4.8 x 10° 

Va 10.4 3.75 700 bol x 10 

3k 14.75 L.3 x 10 
7 9.75 700 4.3 x 10 
7 2.5 700 3.6 x 10 
12 9.75 700 3.32 10 
2 9.75 700 2.8 x 10? 
7 9.75 700 2.8 x 
3.75 1.2 10 
3.75 00 1.3 x 
14.75 700 1.0 x 10 
10.4 14.75 on x 10 


® All data reduced to 700,000 psi from test data. 


ee Test speed = 30,000 cycles/minute. 


These results indicate that By life is pre- 


same measure of fatigue life 

a function of hardness 

a function of grain size 

a function of surface finish 

a function of maximum Hertzian stress 

general coefficient to be determined by statistical 
methods applied to test data 


Transactions of the ASME 


7 
645 2.166 3.607 8.060 
1.001  1.7h0 2.69 
665 
665 
665 
665 
732 
732 
732 
732 
An 2329 1,002 
58 
se 
ca 
‘ 
Sk 
52 ? 9.75 700 x 10° 
2 «58 ? 17 700 2.1 x 16 
u 58 7 9.75 700 1.0 x 10 : 
11 3.78 70¢ 033 x 1 
2 10.75 3075 700 x 10 where 
1.78 700 we x 10 Y= 
Sh 104 14.75 700 = 
15 52 7 9.75 700 
B = aad 
\\ 
che 


5 Experiment to determine the effect of hardness, grain size, surface finish, and stress on life of bearing materials 
Sum-ery of Results 


Variables = Hardness, Rc; = Grain Size, AST; = Surface Finish, Z), = Stress 
+ Keg 4 Brat A+ + hay + fag + fag 


PSI, Max. Hertz x 10) 

4% 

Mean 

Log of Values 

15.083532 
«72147809 
21098247 
20988 33920 
227899127 
«38992290 
20474037 34 
-09281592h, 
«12882581 
2065515940 
2029184701 
2032902285 
207266280 
+0090334937 
038707809 


Y, Ye 

M = Shape Parameter ©-= Scale Paraneter 10% Life Based on 
For Weibull Distribution For weibull Distribution Order Statistics 


2. 329LL55 42691,27.7 
«13181985 245727929 559151.9 
14732217 5627L5.0 331716.4 
2076719218 42€937.9 245798.9 
2017606215 1189920,1 372022,7 
2354221,00 394921.2 35893304 
25056545 19012.3 124135.0 
235110697 399 306.2 
6529.1 027 343.0 
2068761037 639619.0 
«15323202 411832.6 307471.2 
20607 36312 55677347 123566.7 
750158.L, 332411.8 
2013692799 39818L.3 211783.7 
203517L8L8 64293749 216232.5 

Identification of Variables 


Arithmetic Mean 
3786878.8 
2212030.3 

468309.3 
361559.3 
1062018.1 
329260.1 
16010.3 
371253.5 
10873.9 
538209.0 
343922.9 
504,700.5 
671914.7 
335794.6 
57670k.3 


34217003 
«25952200 


Hardness - 
2/1130 

Grain Size - 6.936000 

Surface Finish ~- 9.390000 

Stress - 


8.08000 


Sane 
Same 
Same 
Sane 


Same Same 

oame Same 

Same Sane 

Sane Same 
Analysis of Variance 

MS Due to Regression 1353362928. x 10% 45666929 

38886900. x 104 


1.21756h29 
«13562000 

£9645 324632 

Significance Level 5% us 1c"; 


17.COSL6L2. x 104 2025601928. x 103 
9091LL500. 103 
2.2280 


Almost 10% 


~68829200 
258L 2474 
2.663L 
10% 


MS Error 0193501600 491042500. x 104 


F Ratio 3.4203 2.33% 


As can be seen from this equation, both the independent and the 
interaction effects of hardness, grain size, and surface finish are 
considered. This equation also includes the maximum Hertz 
stress as a variable. 


Rc =ROCKWELL"C" HARDNESS GS*GRAINSIZE ASTM S.F SURFACE FINISH (rms) 


6s.7 
STRESS 700,000 psi 


Results of Statistical Analysis (B,,-Life Equation) 


The results of the statistical approach are tabulated in Table 5. 
As can be seen from the table, the various coefficients of the gen- 
eral life equation are evaluated for different measures of fatigue 
life. One commonly used measure is the By fatigue life. The 
By life of a collection of tests is the life up to which 10 per cent of 
tests in a population will have failed. This life is of primary in- 
terest in that we are generally more interested in being able to 
predict the occurrence of early failures rather than the magnitude 
of the mean life. While it is not possible to predict the By life 
from a given number of failures with the same degree of confidence 
as that obtained for the mean life, the statistical results of this 
analysis are at a significant level of almost 10 per cent. 

Representative graphical presentations of the results of the 
B,-life equation are shown in Fig. 1. 


sTRESS 


54 58 2 
HARDNESS (Rc) 


+ 


4 
SURFACE FINISH (rms) 
SF 37S 


Bio Fatigue Life x10"* Cycles 


o-n we we 


MAXIMUM HERTZIAN STRESS 


Graphical representation of By fatigue life equation 


2 4 6 8 Ww 

ASTM GRAIN SIZE 
Fig. 1 
equation are shown in Fig. 1. As can be seen from the curve, 


this mathematical expression predicts lives which are negative 
for various combinations of the test variables. This equation 


Discussion of Results Predicted by the B,,-Life Equation 


The effects of hardness, stress, surface finish, and grain size 


upon the By fatigue life of M-50 as expressed by the By-life 
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also predicts an increase in life as the stress level increases for 
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Based Upon 

Tape 
minating %, 
Ao 6568917 

Ar »1327350k 

: A? - 

Az 069088010 
As 
Ai, 
4 
6S 104 
| i2 3 + - - 
4 
2 Rc 

| 
\" 

4 


other combinations of these variables. These peculiar results 
may be attributed to the fact that the initial general equation is 
of a quadratic form for each variable. In effect, this assumes 
that each variable produces a life which passes through either a 
minimum or a maximum as the variable is monitorically in- 
creased. 

Graphical comparisons of the test results and the results ob- 
tained from the empirical By-fatigue-life equation are made in 
Figs. 2, 3, and 4. 

Fig. 2 shows that a close correlation exists between the mathe- 
matically predicted effects of grain size upon the By life and the 
effects of grain size obtained experimentally. 

The effect of surface finish upon life, however, is not clearly 
defined. Although Fig. 3 shows a good correlation between the 
mathematically predicted and experimentally obtained lives, 
only Fig. 3(d) substantiates the result that a minimum life occurs 


B,o FATIGUE LIFE 
CYCLES 3 


O94 86h OF 4 8 
GRAIN SIZE 
—TEST RESULTS ----COMPUTED LIFE 


Fig. 2 Comparison of computed and experimentally obtained B,, fatigue 
life versus grain size 


© = TEST POINTS 


by H+ STRESS 732,000 Psi — 


| | 
| 
t+ +4 


o 486 2 & 


_ SURFACE FINISH 


Fig. 3 Comparison of computed and experimentally obtained B,, fatigue 
Jife versus surface finish 


o= TEST POINTS ] 


Bo FATIGUE LIFE 
xX 


Re 

0 75 SF. 14.75 
00. 700 600 600 000 800 


MAXIMUM HERTZIAN STRESS 110°? Psi 


Fig. 4 Comparison of computed and experimentally obtained B,, 
‘fatigue life versus maximum Hertzian stress 
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at a surface finish of about 10 rms. It is felt that these predicted 
minimum lives are greatly influenced by a set of high lives ob- 
tained at a surface finish of 17 rms (see Table 3). Since only one 
set of test data is obtained at this relatively coarse surface finish, 
any conclusions based predominantly upon this data must be 
questioned. 

From Fig. 4, we find that the equation relating stress to life is 
not valid for certain combinations of grain size, hardness, and 
surface finish. In view of these shortcomings, the discussion of 
the results must be restricted within certain limiting values of 
each variable. 

The discussion of results which follow are valid for hardnesses » 
greater than 54 R., grain size larger than ASTM 10, and surface 
finishes better than 10 rms. In order to simplify the By) equation, 
the stress variable is eliminated by evaluating all lives at a 
modified Hertz stress of approximately 700,000 psi. The result- 
ing equation is called the Modified-B,-Fatigue-Life equation 7 
since it eliminates all of the stress-dependent terms. 


The Modified-B, ,-Fatigue-Life Equation 
In order to obtain our modified equation, the general life equa- 
tion is rewritten as follows: 
Life = Bo + + Buri + + Burs)... 
4+- 22[B2 + Boore + Bars] + + Burs)... 
+ + Burs + Bore + Bours] 


At a maximum Hertz stress of 696, 440 psi (assumed to be 700,- 
000), X, = 0, Table 4. If, therefore, the By lives are evaluated 
at a stress of 700,000 psi, the modified equation reduces to: 
By life (700,000 psi) = By + Buri + + eee 
+ + Boor, + Burs)... 
+ + Burs] 


which is limited to: (a) Maximum Hertz stress of 700,000 psi; 
(b) hardnesses greater than 54 R,; (c) grain size larger than ASTM 
10; (d) surface finishes better than 10 rms, because of the statisti- 
cal data used in obtaining the equation. 

Values of X,, X2, and X; are tabulated in Table 6. 


Table 6 Conversion table 


3 
Re G.S. S.F. x 10° 
50 2.177130 i -1.892105 600 
51 -1.907683 2 -1,573354 2 610 69335 
52 =1.63823 3 1.25160 3 620 
- 

53 -1.368789 = 0935852 630 -2 070183 
54 5 - 5 640 -1.75859 

28298 6 6 7009 


57 = .2910025 2339151 8 = 670 - 62383 
58 = 9 657902 9 = 4079953 680 .5122h8 


2247891 


12396 690 = .2006619 
60 «517337 1.29540 1 032717 700 
61 78678 12 1.6144 2 530341 710 2422521 


1 0 . 
63 1.32567 2.2516 730 1..0b 5685 
2.57040 15 1.14003 740 1.357275 


6s 1.86457 16 2.68915 16 1.332 750 1.6688 
2.13401 3.20791 17 1.54640 760 2.036531 
3.526610 2.29203 


3.84512 2 60%2 
69 2.Wi235 20 4.168163 20 2.156102 7 2.91520 
3.2267 


72 3.461252 22 .80166 
72 3.75069 23 5.12016 
73 2k 
7% 4.28959 25 5.757916 
4.55903 

76 4.62648 


76 § 6097933 


69062738 


Transactions of the ASME 
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Quantitative Results of Modified-B,,-Fatigue-Life Equation 


The graphical representation of the effects and interactions of 
hardness, grain size, and surface finish upon the Byo fatigue life on 
M-50 bearing material are shown in Figs. 5 to 17. Figs. 5 to 9 
represent the interaction effects of grain size and surface finish 
upon fatigue life at different levels of surface finish. Figs. 10 to 


2 Grain Size - Max. Hertzian Stress 700,00 psi 


a4 
4.0 |} 4 
Bio FATIGUE LIFE 3.2 4 
CYCLES 
28 | +4 
1.6 
rms 
68 70 


Fig. 5 By fatigue life versus hardness 


stress 700,000 psi 


stress 700,000 psi 
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Fig. 6 By fatigue life versus hardness; grain size 4, maximum Hertzian 


Fig. 7 By fatigue life versus hardness; grain size 6, maximum Hertzian 


14 represent the interaction effects of surface finish and bardness 
upon life at different grain-size levels. Figs. 15 to 17 represent 


the interaction effects of surface finish and grain size upon fatigue 
life at different levels of hardness. These results are graphically 
summarized in the grain size, surface finish, and hardness “sensi- 
tivity curves,”’ Figs. 18 to 21. 


MAXIMUM STRESS - 700,000 psi 
26 
22 
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“HEMT & 
HARDNESS Rc 
Fig. 8 B, fatigue life versus hardness 
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HARDNESS Re 
Fig. 9 By fatigue life versus hardness 


HARDNESS Rc 
Figs. 10. and 11 By life versus hardness 
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Discussion of Qualitative Results Obtained From Modified- 
B,.-Fatigue-Life Equation 


Although the quantitative results obtained from the formula 
are based upon a maximum Hertz stress of 700,000 psi, the quali- 
tative results indicate the following effects of hardness, grain size, 
and surface finish upon the By life of M-50. 

Effect of Hardness Upon the Bi) Fatigue Life of M-50 (For Hardnesses 
Greater Than 54.R,). For hardnesses greater than 54 R,, the By 
fatigue life of M-50 increases with increasing hardness, reaches a 
maximum life at an optimum hardness, and then decreases. This 


Surface Finish 6rm-s. tom Hertz Stress 700,000 psi 
36 
ti 


32-774 


Bio FATIGUE LiFe 2-8 
xIO-SCYCLES a4}- 


20 


54 56 58 60 62 70 
HARDNESS Re 


Fig. 12 By fatigue life versus hardness 
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Fig. 13 Bi fatigue life versus hardness 
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Fig. 14 By fatigue life versus hardness 
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effect is shown in Figs. 5 to 14. This optimum hardness is not a 
constant but varies with surface finish and grain size. This effect 
is shown in Fig. 22. 

Effect of Grain Size Upon the Bio Fatigue Life of M-50 (For Grain Sizes 
Larger Than ASTM 10). For grain size larger than ASTM 10, the 
By fatigue life increases as the grain size increases. The By life 
approaches a constant maximum value at the large grain sizes 


Re 54 MAXIMUM HERTZIAN STRESS 700,000 psi 


iii ti 

1 t t + 

|_| | + + + + > + 

(682,46 

2 & | res 

23456789WN MH 


Surface Finish (rms) 


Fig. 15 By fatigue life versus surface finish 
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Fig. 16 By fatigue life versus surface finish 
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and decreases rapidly as the grains become smaller. This effect 
is shown in Fig. 20. 

Effect of Surface Finish Upon the Bio Fatigue Life of M-50 (for Surface 
Finishes Better Than 10 rms). For surface finishes better than 10 
rms, the By fatigue life increases as the surface finish imp-oves. 
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Fig. 19 By life versus finish for various hardnesses and grain sizes 
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Fig. 20 By versus hardness for various grain sizes and surface finish 
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This life increases steadily with improving surface finish and ap- 
proaches a minimum at a surface finish of 10 rms. This effect is 
shown in Fig. 21. 

Interaction Effects of Grain Size, Surface Finish, and Hardness Upon 
the Byo Fatigue Life of M-5O (for Hardness > 54, Surface Finish <10 rms, 
Grain Size > ASTM 10). Within the limiting regions of each varia- 
ble mentioned above, the interaction effects between the variables 
may be best explained in terms of the “sensitivity’’ curves. 
These curves represent a measure of the sensitivity of fatigue life 
upon two of the variables at different levels of the third variable. 
The most significant results can be seen from the “hardness- 
sensitivity’ curves in Figs. 20 and 21. These curves show that 
at the high hardness level of 62'/, R,, the By life is extremely sen- 
sitive to grain size and surface finish. The area under this curve 
is a measure of this sensitivity. This sensitivity decreases at the 
intermediate hardness level of 58 R,, and the By life becomes in- 
sensitive to variation in grain size and surface finish at the low 
hardness level of 54 R,. Results obtained from the grain-size and 
surface-finish sensitivity curves do not exhibit this same drastic 
interaction effect, although decreased ranges in predicted life 
occur as the grain sizes decrease and the surface finish becomes 
poor. This effect is shown in Figs. 18 and 19. 


Summary 


The effects of hardness, grain size, and surface finish upon 
fatigue life of M-50 are evaluated within certain ranges of each 
variable. The experimental and computed results compare 
favorably at the various hardness, surface-finish, and grain-size 
levels, Fig. 23. It should be noted, however, that this comparison 
is based upon limited data, which is highly statistical. In addi- 
tion, further approximations are introduced into this analysis by 
converting all the test data to a stress level of 700,000 psi. 
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Fig. 22 Effect of grain size and surface finish upon optimum bearing 
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Fig. 23 Comparison of computed By lifes and experimental results 
(maximum Hertzian stress 700,000 psi) 
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DISCUSSION 
Thomas L. Carter and William J. Anderson‘ 


The author presents some very interesting data on the effects of 
hardness, surface finish, and prior austenite grain size on rolling 
fatigue. The effect of hardness, in particular, on rolling fatigue 
has been the subject of several investigations with both simplified 
fatigue tests and with full scale bearing tests. In unpublished 
work completed several years ago, a bearing manufacturer found 
that the fatigue life of SAE 52100 bearings increased with hard- 
ness to the highest hardness tested (Rockwell C 64). In work at 
the NASA Lewis Research Center, tests in the fatigue spin rig 
and the Five Ball Tester on M-50, Halmo, M-1, and WB-49 alloys 
indicate that for each alloy life improves with hardness. In no 
instance was there any significant dropoff in life with increasing 
hardness. Prior austenite grain size was on the order of 6 to 8 
and surface finish approximately 1 to 2 microinches rms for all 
four alloys. Maximum hardnesses tested were as follows: 


M50 63.5 Re 
Halmo 62 
M-1 66 
WB-49 68 


With all of these data it was noted that the rate of life increase 
fell off with increasing hardness. Therefore, if a second degree 
curve were to be fitted to any of these data, it would predict an 
optimum hardness, perhaps beyond the range of attainable hard- 
nesses with a particular alloy, but nevertheless an optimum. 
The author’s statistical approach reveals an optimum hardness 
(at the best surface finish and grain size) beyond the hard- 
ness range of his actual tests. It is somewhat questionable 
whether this optimum hardness actually exists. 

The author assumes a linear variation in life with stress to 
interpolate between stress levels. In most rolling fatigue tests, 
life varies inversely as the ninth power of stress. Stress-life 
exponents calculated for the various groups of life data obtained 
at two stress levels vary from 1.2 to 13.3. If each of these sets 
of four data points is considered significant, then this scatter in 
stress-life exponent is worthy of further investigation. This 
scatter may stem partially from the wide confidence bands that 
result when only 4 data points are used to form a Weibull plot. 

Although, as the author states, the procedures used in the 
statistical analysis are beyond the scope of the paper, a reference 
should be given so that interested readers may follow the analysis. 


Daniel Lundquist® 


The author has utilized a simulated rolling contact bearing test 
rig in order to explore the effect on fatigue life of variations in 
surface finish hardness and grain size. The simulated rig is 
uniquely adaptable for securing a large number of data in a short 
period of testing time, thereby allowing the researcher to obtain 
data on rolling contact fatigue which would otherwise be ex- 
tremely costly and time consuming. The author and his as- 
sociates are to be congratulated on the analysis of the inter-re- 
lated effects of three variables which are thought to be of impor- 
tance in determining fatigue life. Itis hoped that other metallur- 

4 NASA, Lewis Research Center, Cleveland, Ohio. 


5 Assistant Chief Engineer-Administrative, Marlin-Rockwell Cor- 
peration, Jamestown, N. Y. 
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gical and dimensional variations can be studied at a later date. 

The effect of surface finish on rolling contact fatigue life has 
been the subject of investigation in full scale tests on ball and 
roller bearings. These tests have been conducted at compressive 
stress levels considerably below those reported in this paper. At 
these levels, tests on 35 millimeter bore bearings conducted at 
Marlin-Rockwell Corporation have revealed that variations in 
finish ordinarily encountered in bearings are not highly signifi- 
cant. Ground, ground and polished, honed, and etched surfaces 
have been investigated. 

The compressive stress levels of 732,000—645,000 used in the 
reported tests are higher than the vield point of M-50. There- 
fore it might be expected that the surface finish of thes pecimen 
could change due to a burnishing action. It would be interesting 
to know if the surface finishes changed after a period of operation 
in the rig. 

It would also be of interest to learn if the surface finish values 
were measured both axially and circumferentially on the test 
bars and rollers. It is possible to conceive of a surface which 
would contain circumferential grooves. Such a piece might have 
a good finish measured circumferentially, but would cause high 
stress concentrations in the contact area. 

It is noted that in order to compare the test results on an equal 
basis that the By and By lives for various stress levels were con- 
verted to new values based upon a linear interpolation between 
tests at different stress levels. 

From full scale bearing tests and from other simulated rig 
tests, an inverse cubic relationship between life and load has been 
established. This means that the life should vary inversely as 
the ninth power of the stress for the type of contact used in this 
test. By linear interpolation, a By life of 4.4 million cycles at a 
stress of 645,000 psi and a By life of 1.5 million cycles at a stress of 
745,000 psi convert to a By life of 2.8 million cycles at 700,000 
psi. By ninth power interpolation, the Bs life is 3.2 million 
cycles. 


Author’s Closure 


The relationship between stress and life in bearings has been 
noted to vary inversely as the ninth power of stress. not linear. 
This important relationship has been observed and verified by 
unpublished experimental RC Rig data obtained by the author. 
The linear interpolation used for the benefit of this statistical 
analysis yields results only slightly varied from the ninth power 
interpolations and, for the rather narrow range of stresses used 
in this experimental work, is within the experimental error intro- 
duced from the test results. We do not wish to convey an ex- 
pression of disbelief in the ninth power relationship, however, to 
point out that for this analysis, use of the linear relationship was 
satisfactory. 

Wear track surface finish, measured both transverse and longi- 
tudinal to rolling, was found to improve slightly during running. 
For example, the rms 17 surface finish was rms 10-12 after long 
running. Similar improvements were observed for all levels of 
surface finish tested. This results from a burnishing action be- 
tween rollers and test specimens. 

References for the statistical analysis techniques may be found 
in: E.'S. Pearson and H. O. Hartley, “Biometrika Tables for 
Statisticians,’’ Cambridge University Press, 1958; H. Cramer, 
“Mathematical Methods of Statistics,’’ Princeton University 
Press, 1951; P. G. Hoel, “Introduction to Mathematical Statis- 
ties,’”” John Wiley & Sons, Inc., second edition, 1954; J. H. K. 
Kao, “Computer Methods for Estimating Weibull Parameters in 
Reliability Studies,’ Research Report #EE344, School of Elec- 
trical Engineering, Cornell University, 1957; and M. Zelen and 
J. Fieblein, “Statistics Investigation of the Fatigue Life of 
Deep Groove Ball Bearings,’”’ Journal of Research, National 
Bureau of Standards, vol. 57, 1956, p. 273. 
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The Importance of Spinning Friction 


G. REICHENBACH 


Assistant Professor of Mechanical Engineering, 
Massachusetts Institute of Technology, 
Cambridge, Mass. Assoc. Mem. ASME 


in Thrust-Carrying Ball Bearings 


Experimental work was done rolling balls on flat plates and in V-grooves at loads and 
contact angles corresponding to usual thrust-bearing practice. 


It is shown that the 


spinning action of the ball with respect to the race should account for the major part of 


the over-all friction of a thrust-carrying ball bearing. 
tact angle, conformity, load, lubricant, and temperature. 


Variables studied included con- 
The results have been corre- 


lated and shown to follow theoretical predictions. 


Introduction 


im coefficient of friction of a ball bearing is very 
small, of the order 0.001 to 0.003. Yet even this small value is of 
great concern to designers in many applications. The main 
thrust bearing of a jet engine which is subject to extreme loads 
and speeds may absorb as muchas 25hp. The lost power is not so 
important as the problem of removing this large quantity of 
energy from the cooling oil. At the other extreme is the problem 
confronting the designer of gyroscope bearings used in inertial- 
guidance systems. He has limited power available in his driving 
motor to overcome the friction and is troubled by navigational 
errors which are an outgrowth of the bearing friction. While 
gross values of friction coefficient for many types of bearings are 
known, the basic details of the motion which lead to the over-all 
friction foree and the relative importance of the various factors 
are still not completely resolved. 

This paper will restrict itself to consideration of ball bearings of 
the thrust type. Many of the conclusions drawn will be appli- 
cable to other rolling-contact situations but will not be discussed 
here. 

If a ball in a high-speed bearing is examined, it is found to be 
under the influence of several forces, Fig. 1. 

In general, because of the centrifugal force on the ball, the 
points of contact will not be on opposite ends of a ball diameter 
but instead the outer contact angle a will be less than the inner 
angle a,;. Jones [1]' has worked out in detail methods for cal- 
culating these angles. If the relative angular velocities of a ball 
are examined with respect to a race, it is found that for a thrust- 


' Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division of Tue American So- 
creTy OF MECHANICAL ENGINEERS and presented at the ASLE-ASME 
Lubrication Conference, New York, N. Y., October 20-22, 1959. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, August 3, 
1959. ASME Paper No. 59—Lub-9. 
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carrying ball bearing simple rolling motion cannot obtain. As 
can be seen from the vector diagram of Fig. 2, the ball must have 
a spinning velocity w,,; with respect to the inner race. The 
condition of outer-race control is illustrated; i.e., pure rolling at 
the outer race contact, rolling plus spinning at the inner race. 
Depending upon the relative conformities, speed, and ball size, 


(A) No Load (B) Looded (C) Looded ond Rotating 


at High Speed 
Fig. 1 Forces acting on a ball in a high-speed angular contact bearing 


8 


Angvlar velocity relationships for a ball in an angular contact 


Fig. 2 
bearing 


inner-race contact angle 

outer-race contact angle 

thrust load on a bearing 

centrifugal force of a single ball 

normal force on a ball at outer- 
race contact 

normal force on a ball at inner- 
race contact 

angular velocity of inner race 

angular velocity of a ball 

angular velocity of inner race 
with respect to a ball 
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velocity 


velocity 


F/N 


spinning component of relative 
rolling component of relative 


radius of curvature of a race 
conformity f = r/D 


sliding-friction coefficient in a 
spinning Hertz contact zone 


rolling coefficient of friction = 


force required to roll a ball on a 
flat plate or in a V-groove 

normal force at a ball contact 

rolling moment 

spinning moment 

ball diameter 

vertical distance between ball 
contact on upper and lower V- 
groove 

included angle of V-groove (for a 
flat plate 6 = 180 deg) 
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the motion may be inner-race control, outer-race control, or 
spinning on both races with neither controlling. 

The relative angular velocity between the inner race and ball 
@,), may be broken into two components, a rolling component 
and a spinning component. This leads to two sources of energy 
absorption, moments which resist, respectively, the spinning and 
the rolling angular velocities. In addition to these two modes of 
energy absorption, there are several others not directly connected 
with the rolling action. First in oil churning, a significant con- 
tribution when using oil jets to supply the lubricant in high-speed 
bearings. The use of an air-oil mist may largely eliminate this 
factor. Also there is friction at the ball-cage contact. The con- 
tribution of this has not been measured accurately to date, but 
the feeling is that the other factors account for most of the ob- 
served friction so this must be small. 

The work to be reported here concerns itself with only the 
first two mentioned losses; namely, spinning and rolling friction 
in the contact zone. 


Apparatus 


Direct determination of the operating contact angles, ball- 
spinning velocity, and so forth, is very difficult for real bearings. 
Shevchenko and Bolan [2] have done this for one case photo- 
graphically but did not have torque instrumentation on their 
apparatus. The approach used here is to examine a simple ap- 
paratus where the contact angles, spinning velocities, and rolling 
forces may be easily measured. In Fig. 1 tangent lines have been 
drawn at the two contact points of the ball. These tangents are 
not parallel and in a sense form a V-groove of angle 8. The mo- 
tion of the ball in a bearing is in many ways analogous to that of a 
ball rolling in a V-groove. In an actual bearing, however, the 
spinning is not usually equal at both contacts and the conformi- 
ties may differ at the two contacts. 

Experimentally it is fairly simple to roll balls in grooves 
and measure the resulting friction forces. Fig. 3 shows such an 
apparatus. The load is applied with a standard mechanical ten- 
sile-testing machine. The center plate is reciprocated with a 1-in. 
stroke, and force required to move the center plate is recorded 
using a resistance strain-gage ring as the measuring element. 


Fig. 3 Test apparatus for V-groove rolling experiments 
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The force was found to be independent of velocity for rates of 10 
cycles per min (cpm) or less. Two tests were run. In the first, 
the apparatus of Fig. 3 was used without any V-grooves, only flat 
parallel plates. In the second, a pair of V-grooves was inserted 
as shown. Only one set of grooves was used to avoid wedging 
effects caused by possible misalignment of multiple sets of 
grooves. The apparatus as shown is self-aligning. The balls 
and rolling surfaces in all cases were 52100 steel, hardness 58-60 
Rockwell C. The finish on flat surfaces was lapped, the balls 
had a standard first-quality finish. The finish on conforming V- 
grooves was probably the worst of the group, owing to the 
difficulty of grinding in the range 2-6 win. rms. Surface finish 
was found to influence the friction forces slightly, but has not 
been investigated in detail as yet. 

A total of six balls was used, three at each level. The load was 
assumed to be evenly distributed to each ball. This was ac- 
complished using a 3-in-thick piece of soft rubber between plates 
at the top of the apparatus to absorb any lack of parallelism be- 
tween the testing-machine surfaces, or wedging effect due to the 
motion of the center plate. There was little difficulty in repeating 
measurements from setup to setup. The testing machine indi- 
cated the same total load at any position of the middle plate. 

The variables studied were lubricant, temperature, groove an- 
gle, conformity at the contact, and stress level. 


Experimental Results 


Flat-Surface Tests. The first tests used 1'/,-in. balls on flat sur- 
faces and confirmed the results of Drutowski [3]. Defining a 
rolling-friction coefficient pz as in Fig. 4, most values of up, were 
in the range 10-*-—+ 10~*. A rolling friction of this small value 
may be accounted for entirely by internal hysteresis or damping 
of the metal. Rolling friction is perhaps more realistically 
measured in terms of a rolling moment M, defined as in Fig. 4. 
A similar total moment may be calculated for balls in V-grooves 
if the ball diameter is replaced by the distance S as shown in Fig. 7. 

Fig. 5 shows a typical curve for flat-plate tests on a log-log plot. 
Two regions are noted: An apparently elastic region below 150-lb 
load and a plastic region above 150-lb load. The dividing line 
between the two regions occurs at a stress of about 190,000 psi 
for this steel. These results are not reported or analyzed in de- 
tail as they do not differ from Drutowski’s work. These readings 
were used only to correct later readings using the V-grooves. It 
might be noted that in the usual range of viscosity, widely dif- 
ferent types of oil or no oil at all gave the same force readings 
when used on flat plates. 

V-Groove Tests. The next group of tests replaced two of the flat 
surfaces with V-grooves of various conformities. Common prac- 
tice defines conformity f as: 


TR 
f D (1) 


where rp is the radius of the race and D is the ball diameter. 

Using this definition, a flat plate has © conformity while a 
typical ball bearing has 52 — 54 per cent conformity. 

Fig. 6 shows data for various configurations. All the data have 
been reduced to a common basis of one ball with two contact 
points. The rolling moment on a flat plate is almost negligible 
when compared to balls rolling in various types of V-grooves. 
This difference is attributed to the spinning action that the ball 
has as it rolls in the groove. 

A free-body diagram of a ball rolling in a V-groove may be 
drawn as in Fig. 7. The ball is resisted in its motion by both 
spinning moments Mg and rolling moments Mp. The rolling 
moments have been determined already from the flat-plate ex- 
periments. Moment equilibrium of the ball requires 
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Fig. 5 Rolling force versus normal load for a ball rolling between fiat 
plates—1' /,-in. diameter ball 
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Fig. 6 Rolling # versus | load for several configurations— 
1'/,-in, balls, MIL 7808 Oil 
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Fig. 7 Free body diagram of moments acting on a ball rolling in a V- 
groove 


Spin Moment — pound,inch 


8-90% V-groove 


O-120° V-groove 

1 
10 100 1000 

Normal Load at Contact — pounds 


FS =4 (a1, + My, sin (2) 


Using this relation, the spinning moment has been determined for 
a variety of situations and the results are plotted in Fig. 8. On 
this log-log plot, all the slopes are approximately ‘/;, agreeing 
quite well with the analysis of Poritsky, Hewlett, and Coleman 
[4]. They integrated the friction force over the contact ellipse 
to obtain a spinning moment. A constant spin-friction coefficient 
Ms and a normal stress distribution equal to the Hertz distribu- 
tion were assumed. The result is 


3 
M;= 


where E(k) is the complete elliptic integral of the second kind, 
k = [1 — (b*/a*)]'” (3) 


a is the major semiaxis of the contact zone, and 6 is the minor 
semiaxis of the contact zone. Fora circular contact zone (b/a = 
1), this reduces to: 


3r 
= 
Ms ig (4) 


If the contact zone is elliptical with a/b > 7, less than a 2 per 
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Fig. 9 Spin friction coefficient versus mean contact stress for various 
configurations, Mil 7808 Oil 


cent error results if E(k) is taken equal to unity. For intermedi- 
ate values, E(k) must be taken from tables. 
As the radius of the contact zone varies as N'/*, it follows that 
Mg varies as N“/* as confirmed by the experimental results. 
Equation (3) has been used to evaluate the friction coefficient 
for the data shown in Fig. 6, using the appropriate normal loads 
and conformity. The result is shown in Fig. 9 where friction co- 


- efficient has been plotted versus mean compressive stress in the 


contact zone. A value of wg = 0.07 describes most of the data 
quite well. There seems to be a trend toward higher values of us 
at low stresses. However, this should be taken as tentative, 
since the data become relatively less accurate as the loads are de- 
creased. 

Examination of an actual force trace for the V-groove, Fig. 
10(a), shows a predominating square wave, as expected, plus 
higher frequency variations which seem to repeat almost exactly 
from cycle to cycle. These variations have been attributed to 
local differences in surface finish and material structure. When 
rolling on plane surfaces, Fig. 10(b), these local variations almost 
obliterate the basic square wave. Local peaks may be ten times 
the average force. Drutowski [3] has examined these local varia- 
tions in detail. 

The scatter in the friction-coefficient values does not seem to be 
entirely random. Several series of tests were run using flat-sided 
V-grooves of various angles. If the friction coefficient is plotted 
as a function. of the spin to rolling angular velocity ratio, a 
definite trend is noted in all cases. Fig. 11 shows a typical set of 
tests. Higher coefficients are associated with large relative 
amounts of spinning. This result seems logical if it is assumed 
that the friction coefficient depends strongly upon an adsorbed 
surface film which can be progressively wiped away by the sliding 
action. If there is a large rolling component, new surfaces are 
continually brought into contact, while the surfaces which have 
been sliding are exposed again to the lubricant where the ad- 
sorbed film can be renewed. If the rolling component is rela- 
tively small, surfaces within the contact area may slide a much 
greater distance as a result of the spinning before they are again 
exposed to the lubricant supply. 

Lubricant Tests. The use of V-grooves proved to be quite a 
sensitive method for determining the friction properties of various 
oils. Fig. 12 shows the force versus normal load for a variety of 
oils. These values are all reproducible except for the “dry sur- 
face.’’ The exact manner and care used in cleaning the surface 
gave rise to large variations in this result. The data suggest that 
the running torque of angular contact thrust ball bearings should 
be strongly dependent upon the type of lubricant. 

A calculation was made to see if spinning friction could account 
for the heat generation of a 240-mm aircraft turbine-type thrust 
bearing operating at 9000 rpm. The contact angles were evalu- 
ated according to the analysis of Jones [1]. The spinning and 
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Fig. 10(a) Typical force trace for V-grooves 


Fig. 10(b) Typical force trace for flat plate 


rolling angular velocities were calculated assuming outer-race 
control. The energy generated is then the product of the angular 
velocity and its resisting moment. The contribution of all ball 
contacts is added to give the total energy. Table 1 gives the 
calculated results compared with experimental values of heat re- 
jection. The experimental values have been adjusted to eliminate 
the effeets of oil churning.” 


Table 1 Calculated results compared to experimental values 


Experimental 
Heat generated Heat generated heat 

Thrust by rolling, by spinning, generation, 

load, lb Btu/min Btu/min Btu/min 

2000 7.5 82 90 
4000 8.3 159 136 
6000 15.6 225 181 
10000 36 350 272 


The spinning action by itself can account for almost all the heat 
generated by the bearing. 

Variable Temperature Tests. Work by Accinelli [5] and Kings- 
bury [6] has shown sudden rises or transitions in friction co- 
efficients as the ambient temperature is raised. To see if the 
same phenomena could be observed here, heaters were attached 
to the V-groove apparatus in order to control the temperature of 
the spinning surfaces. A once-through lubricant supply was 
arranged to avoid problems with thermal decomposition of the 
oil. The temperature was then cycled up and down with a cycle 
time of about 2 hr. The maximum temperature was limited by 
annealing of the 52100 steel surfaces. Fig. 13(a) shows the results 


2? These experimental values have been measured by Pratt & 
Whitney Aircraft using an extrapolation technique to eliminate the 
effect of churning. 
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Fig. 12 Rolling force versus normal load for several lubricants 0 = 120 
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of two such cycles. There is a rise in friction with temperature 
but the rise becomes less for repeated cycles. However, finally an 
equilibrium curve is attained as in Fig. 13(b). This improvement 
with a number of cycles is attributed to a gradual buildup of a 
varnish film with low friction properties. However, the sharp 
rise in friction still occurs in about the same temperature range. 
This rise has been attributed to desorption of the lubricant film. 
If this rise should occur in a high-speed bearing, the whole process 
may snowball, the friction rise generating more energy resulting 
in a further temperature rise, and so on, until finally the bearing 
is destroyed. The critical-temperature-range rise varies greatly 
for different lubricants. Fig. 13(c) shows another example. Fig. 
13(c) is especially interesting as there is a sharp break in the rising 
curve at 350 F. At the same time the oil began to smoke and de- 
compose rapidly forming a varnish film on the surface. The re- 
duction in friction is attributed to the boundary lubricating ef- 
fects of this film. It is doubtful that a film of this nature would 
form rapidly enough to be helpful in a high-speed bearing. 


Conclusions 


The important conclusions to be drawn here is that in many 
types of rolling contact the major part of rolling resistance is due 
to the spinning action of the balls. The so-called rolling-friction 
moment which has been attributed to hysteresis within the metal 
is at least an order of magnitude smaller than the spinning 
moment. While the rolling moment is known to be relatively in- 
sensitive to lubrication, the spinning moment being due to actual 
sliding within the contact zone exhibits all the characteristics 
of sliding friction; i.e., various oils may cause large changes in the 
friction coefficient; high temperatures may cause desorption of 
the lubricating oil film, resulting in abrupt changes in friction 
moment; the friction coefficient is independent of the contact 
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stress although the moment itself varies as the ‘/;-power of normal 
load due to geometrical considerations. Changes in geometry of 
the bearing which affect the amount of spinning will result in 
corresponding variations in the over-all running torque. 

These experiments were carried out under low-speed conditions 
and one might argue that at higher speeds hydrodynamic forces 
in the contact zone may come into play and complicate the pic- 
ture. While this is certainly true to some extent, the work of 


june 1960 / 299 


° 

° 
02 

= i6 

} 

0 N 

0 400 L 

re) 3 

° ° 
18 

04 

| 


Accinelli, et al. [5], showed that high-speed bearings, when 
operated with varieties of lubricants, showed this same sharp in- 
crease in friction with temperature. In fact, the silicone oil which 
had the most favorable viscosity characteristics for maintaining 
a hydrodynamic film in the face of rising temperature was the 
first to show the sharp friction increase resulting from desorption. 

Inasmuch as many ball bearings are forced by geometrical 
considerations to have a considerable amount of spinning, de- 
signers would be well advised to remember this when specifying 
lubricants. In particular for high-temperature operation, the 
use of liquid lubricants does not seem feasible because of the 
desorption problem. The use of solid-film lubricants either ap- 
plied mechanically to the surface or chemically formed in situ 
would seem more practical. 
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DISCUSSION 
J. B. Accinelli® and J. Beaubien‘ 


The author has used an ingenious approach to the problem of 
defining the friction effect in a thrust loaded ball bearing. The 
experimental work was especially well conceived, to the point, and 
appeared to allow for good control of the significant variables. 

At Shell Development we have been aware of the possible 
effects of ball spin on the performance of very high-speed thrust 
loaded ball bearings and, therefore, have used such a bearing in our 
studies of ball bearing lubrication at very high temperature. In 
our early work, a 20-mm bearing was operated at 40,000 rpm and 
200 lb axial load which gave a calculated relative ball spin velocity 
of 19,000 rpm and a maximum Hertz stress in the contact zone of 


* Shell Development Company, Emeryville, Calif, 
‘Supervisor, Research, Shell Development Company, Emeryville, 
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about 240,000 psi. Under these conditions silicone oil did not 
support lubrication even at low ambient temperature. One con- 
clusion made at that time was that ball bearing lubrication ap- 
pears to be predominantly of the boundary type with perhaps a 
very small hydrodynamic effect. Dr. Reichenbach makes note 
of this conclusion in his paper. However, in some of our most 
recent work, which includes speeds up to 60,000 rpm (represent- 
ing a relative ball spin velocity of 53,000 rpm), we have some 
evidence that hydrodynamic lubrication plays a much more im- 
portant role. At this much higher spin velocity silicone fluid will 
satisfactorily lubricate the bearing provided an adequate flow rate 
is maintained and bearing clearances are not reduced to zero. 
In fact, under these conditions it has been possible to operate satis- 
factorily with a silicone up to 700 F. In the same series of ex- 
periments it was found that total bearing friction was the same for 
silicone, MIL-L-7808C, and a mineral oil if these fluids are com- 
pared at temperatures where their viscosities are equal. Since 
the lubricant in all our experiments was injected at the tempera- 
ture of the bearing, there was no cooling of the bearing by the 
lubricant. The failure of the bearing with silicone oil in our 
early work could have been caused by churning of the more 
viscous silicone, leading to rapid internal heating of the bearing 
and loss of clearances. 

While at the speeds given in Reichenbach’s paper the spin 
velocities are relatively low, and hydrodynamic effects are proba- 
bly small, it would be of interest to study in the V-groove ap- 
paratus several lubricants of widely different boundary film 
properties but of equal viscosity. These could be compared with 
respect to coefficient of spin friction at various Hertz stresses or 
normal load. If the lubricants compared in Fig. 12 are assumed 
to increase in viscosity from the MIL-L-7808 to the synthetic 
sperm oil in the order shown, then the effect observed could be 
explained by an increasing fraction of the load being supported 
hydrodynamically. 

The work at various temperature was most interesting. In our 
work with high-speed ball bearings, friction decreased with in- 
creasing temperature up to a critical temperature for the lubri- 
cant (generally above 600 F). We attribute this to a decrease in 
viscosity which gives lower viscous resistance and churning. 
Above the critical temperature we found that friction increased 
steadily. We attribute this to an increase in the portion of the 
load supported by a boundary film, rather than to desorption. 

The author has contributed a very fine piece of basic experi- 
mental work in an imperfectly understood area of ball bearing 
research. The ball spin effect discussed in the paper is not only 
of interest with respect to lubrication of ball bearings but also 
should be valuable to those studying ball bearing design, par- 
ticularly, where bearing fatigue is a critical factor. 


G. J. Andreini® 


The over-all complexity of ball thrust bearings has made pre- 
diction of performance in new applications a matter of “engineer- 
ing guesstimating.’’ In all cases, these predictions are based on 
empirical data obtained by experience with past applications. 
Extrapolation of these data to any great extent has led to many 
unsatisfactory applications, For these reasons, there is a grow- 
ing need for a better understanding of ball thrust bearing opera- 
tion. 

One method of tackling the problem is to break it down so that 
solutions are obtained for pertinent parts of the problem. Ap- 
plying this philosophy, Pratt & Whitney Aircraft sponsored 
this work under Dr. Reichenbach as one part of the over-all 
understanding of ball thrust bearing operation. This work, in 
combination with the work of Mr. A. B. Jones on dynamics in 


5 Project Engineer, Pratt & Whitney Aircraft Division, United 
Aircraft Corporation, West Hartford, Conn. 
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Fig. 14 Breakaway friction coefficient as a function of radial load 


ball thrust bearings, represents a major step toward understand- 
ing and predicting heat generation in bearings. 

In addition to the pronounced increase in friction with increas- 
ing spin in the contact area, work now in progress at Pratt & 
Whitney Aircraft has shown a defin'.e relationship between spin 
and fatigue life. This testing has shown a decided decrease in 
life with an increase in spin in the contact area and suggests that 
bearing geometry may be optimized to reduce spin while main- 
taining reasonable surface stresses due to applied load. 

It is hoped that other investigators will be encouraged by the 
results reported here to work on the many other significant parts 
of the over-all problem, such as effects of lubricants and their 
properties, effect and formation of hydrodynamic films, and dy- 
namic instabilities as related to ball bearings, so that the com- 
bination of all of these solutions will eventually lead to a complete 
understanding and prediction of ball thrust bearing performance. 


Paul Lewis’ 


The author is to be congratulated for work in the field of ball 
bearing friction which has not received sufficient attention. These 
comments are largely intended to offer supplementary information 
or to elicit the author’s comments on other possibilities. 

The first comment is on the relation between rolling friction 
force as a function of normal load as shown in Fig. 5. This shows 
a lower friction increase with load in the plastic region (above 150 
Ib) than in the elastic region. Because this was obtained with 
flat specimens and balls, the friction should be mainly due to 
hysteresis. It would be expected that the steeper slope should 
be in the plastie region. 

Fig. 9 shows the sliding friction coefficient as a function of mean 
compressive stress. Higher values of friction coefficient are 
suggested at lower stress levels. Fig. 14 shows the breakaway 
friction torque (based on the bearing pitch diameter) of a deep 
groove ball bearing as a function of radial load. Here also the 
over-all coefficient is very high at light loads. 

Fig. 13(a) shows the variation of the coefficient of friction with 
temperature and cycling. The reduction in friction is attributed 
to the formation of varnish. Some of our own work on a similar 
configuration has yielded Fig. 15, showing the variation of friction 
with load. The curve is similar to 13(a) and was attributed to 
initial plastic flow and establishment of an equilibrium track. 
This implies that 13(a) might be caused in part by a softening 
with temperature and plastic flow of the track. 

The final comments refer to the temperature dependence of 
friction and lubricant. Recent work with a once through system 

® Engineer, Ball Bearings, General Engineering Laboratory, 
yoy Electric Company, Schenectady, N. Y. Assoc. Mem. 
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Fig. 15 Rolling friction force as a function of load 


has indicated certain temperature sensitive regions. The region 
with the silicones occurred at relatively low temperature, in the 
range of 250 F. Further testing at 600 F with the same fluid gave 
excellent performance. Very small quantities of fluid were used, 
and it was found that quantity of flow was very critical. Other 
work at 700 F showed a definite relationship of bearing wear with 
the flow rate. 


The comments of Accinelli and Beaubien regarding high-speed 
bearings are very interesting. The work reported here certainly 
cannot explain all their results. Viscous effects were unimportant 
in this work while for 60,000 rpm bearings viscosity must be very 
important. However, room-temperature viscosity cannot be 
taken as an accurate indication of behavior in bearings. It is 
well known that pressure, temperature, and shear rate can alter 
viscosity by many orders of magnitude. However, until actual 
numbers are known for these variations, one can only speculate 
as to the reasons for the high-speed behavior noted. 

Oils of equal viscosity under ambient conditions but of different 
boundary properties were tested. Addition of 1 per cent stearic 
or palmitic acid to a white mineral oil reduced the friction forces 
by 50 per cent. Another indication of the absence of viscous 
effects in these tests was the absence of any speed effects. The 
same friction forces were noted at 6 strokes per hr and 10 strokes 
per min. 

The apparently backward behavior of rolling friction in the 
plastic zone noted by Mr. Lewis is due only to a poor choice of 
co-ordinate paper. Use of semilogarithmic paper gives the 
wrong impression at first glance. However, if the numbers are 
examined carefully, it is seen that increasing the normal load in 
the plastic zone from, say, 400 to 500 lb, increases the friction 
force by about 0.01 lb, while a similar increase in the elastic zone, 
say from 50 to 150 lb, increases the friction force only 0.005 Ib. 

Fig. 14 by Mr. Lewis shows similar friction behavior at low 
loads. However, the explanation here must be different, since 
there should be no spinning in a radially loaded bearing. Perhaps 
the rise in coefficient can be attributed to cage friction. Even at 
zero applied load, the cage will cause a significant friction moment. 

The gradual drop of friction force with repeated cycles, as in 
Fig. 13 and Mr. Lewis’ Fig. 15, may indeed be due to plastic flow 
rather than varnish formation. Actually, in some tests indenta- 
tions several wave lengths of light deep were found in the surface 
after running. 

The results Mr. Lewis’ reports with silicone oil sound very 
similar to the results mentioned by Accinelli and Beaubien. I 
can give no really good hypothesis for the complicated dependence 
on temperature, flow rate, and oil type. 
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The Effect of Ball Bearing Steel Structure 
on Rolling Friction and Contact 
Plastic Deformation 


The rolling friction, contact plastic deformation, and elastic limit were determined for 


SAE 52100 steel structures with retained austenite contents from zero to 18.4 per cent. 
The force necessary to roll a ball on a plate decreased as the retained austenite was de- 


creased. The contact stress necessary for the initiation of plastic deformation and the 


elastic limit of the material increased as the retained austenite content decreased from 


18.4 to 3.9 per cent. 


No further change occurred when the retained austenite was 


reduced to zero. The extent of plastic deformation at very high contact stresses was re- 
duced by the presence of retained austenite contents up to at least 7.4 per cent. These 
observations were applied to the problem of selecting the best steel structure for an instru- 


ment ball bearing. 


HILE IT IS GENERALLY ACCEPTED that ball bearings 
have very low friction torque, even this low torque is of impor- 
tance when the ball bearings are used in instruments of high ac- 
curacy. The friction torque of a ball bearing is the result of a 
combination of sliding and rolling friction components. This 
paper does not consider any of the sliding friction effects that 
may arise either in the contact area between the ball and race, or 
between the ball and separator, but examines only the effect of 
the metallurgical structure of SAE 52100 steel on pure rolling. 

For simplicity, the experimental arrangement chosen is that of 
a ball rolling between two flat plates. When a ball rolls on a flat 
plate the energy loss is that associated with hysteresis losses of 
the elastically strained metals [1, 2].2 The contribution of dif- 
ferential slip in the contact area is negligible as is indicated by 
the fact that lubricants have little effect on the magnitude of tie 
rolling resistance. For balls rolling in deep grooves the differen- 
tial slip becomes more important, but this effect is not examined 
in this paper. The determination of the resistance to rolling for 
the experimental arrangement employed is a measure of the effect 
of material hysteresis on instrument ball-bearing performance. 

The structures of SAE 52100 steel that were selected for this 
investigation have retained austenite contents from zero to 18.4 
per cent. The interest in these structures originated in the fact 
that for this range the hardness reaches a maximum of Re 64 at 
about 4 per cent retained austenite and drops to a hardness of Re 
58 at zero and 18.4 per cent retained austenite. While the sta- 
bility associated with the zero retained austenite structure is de- 
sirable [3], the effect of the reduction of hardness on rolling fric- 
tion characteristics and on the resistance of the material to plastic 
deformation remained to be determined. 


Procedure 
Test Samples. The plates and balls used for this investigation 


' At present, Convair, Astronautics, San Diego, Calif. 

2? Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division of THe American So- 
creTY oF MECHANICAL ENGINEERS and presented at the ASLE- 
ASME Lubrication Conference, New York, N. Y., October 20-22, 
1959. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, June 10, 
1959. Paper No. 59—Lub-2. 
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were made from high quality SAE 52100 ball bearing steel. 
Four heat-treatments were used to produce significant variations 
in structure and physical properties: 


A Austenitize '/, hr at 1550 F, oil quench 90-100 F, temper 
1 hr at 500 F. 

B_ Austenitize '/, hr at 1550 F, oil quench 90-100 F, subcool 
1/, hr at —321 F + '/, hr at 250 F, cycled 5 times. 

C Austenitize '/, hr at 1550 F, oil quench 90-100 F, temper 10 
hr at 250 F. 


D Austenitize '/; hr at 1800 F, oil quench 90-100 F, temper 1 
hr at 350 F. 


After the heat-treatment the surfaces were ground and polished 
to a surface roughness of less than one microinch in preparation 
for the rolling friction runs. Retained austenite determinations 
were made on these surfaces using a modified x-ray diffraction 
technique based on the work of Averbach and Cohen [4]. 

Elastic limit in tension was determined on specimens having a 
0.252-in-diam reduced section using a residual strain technique 
described by Muir, et al. [5}. 

For determinations of the rolling characteristics and of the 
contact stress necessary for the initiation of plastic deformation, 
the steel balls of a given structure were rolled between plates of 
the same structure. Additional data on the contact stress neces- 
sary for the initiation of plastic deformation were obtained by 
rolling a '/2-in. diam cemented-carbide ball, with an elastic 
modulus of 72 X 10* psi, on each of the plate structures, and data 
on the extent of plastic deformation at high stresses were ob- 
tained by rolling '/, and '/,-in-diam balls of cemented-carbide on 
each of the plate structures. 

Rolling Friction Apparatus. The rolling friction apparatus is 
shown in Fig. 1. The lower sample plate is located in a holder 
which is supported by a high pressure oil film and is elastically 
restrained by a pair of rolling force beams (only one of which is 
shown in the figure). The upper sample plate is located in a 
holder which is attached to the free end of the normal force beams. 
Moving the carriage along the straight ways by a hydraulic ram 
(not shown) causes the ball to roll between the plates. The re- 
sistance to rolling is measured by the deflection of the rolling 
force beams, and the load is maintained and measured by the de- 
flection of the normal force beams. The apparatus and the 
technique for measuring rolling force are described in detail in a 
previous publication [2]. 
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A rolling frictior. run consisted of rolling the ball back and forth 
between the two plates and over the same track until equilibrium 
was established and then determining the average rolling force for 
five pairs of passes. Equilibrium conditions were attained 
within five pairs of passes for the samples used. For all the 
runs the velocity of the ball center with respect to one of the 
plates was held constant at 0.03 in. per second and the length 
of the track was 1'/,in. The values of the rolling force were cal- 
culated for the contact of the ball with only one of the plates. 
Average rolling force was determined as a function of contact 
stress for each structure. 

Plastic Deformation. In addition to determining the rolling fric- 
tion characteristics of the four structures, observations were made 
of the initiation and extent of plastic deformation in the track. 
At small contact stresses no plastic deformation was evident but 
as successively higher contact stresses were applied, the plates 
developed a groove that was identified by visual observation. 
As the contact stress increased the extent of plastic deformation 
grew larger. The resulting grooves were measured with a surface 
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Table 1 Some physical properties of 
52100 steel structures 


d and tempered 
Dimensional 

stability 
microin/in. 
change in 

Elastic % 1000 
limit, Hardness Retained at 165 F and 
Structure psi Re austenite 20,000 psi 
A 123 ,000 58 0 7-18 

B 127 ,000 64 3 
Cc 58, 000 62 7 
D aS 59 18 


4 
4 


profile instrument and the extent of plastic deformation was de- 
termined for various contact stresses on the four structures. 


Results and Discussion 


Physical Properties. The physical properties that were deter- 
mined for these four structures of 52100 steel are presented in 


Fig. 2 Average rolling force versus contact stress for 
various structures of SAE 52100 steel. Ball, 1/2-in. 
diam, of same structure as plates. 
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Table 1. The influence of retained austenite on the ability of the 
material to resist microplastic deformation can be seen by noting 
its effect on the elastic limit. With retained austenite levels up to 
3.9 per cent the elastic limit remains at a maximum value. How- 
ever, when the structure contains 7.4 per cent retained austenite 
the elastic limit is drastically reduced. At the same time the 
dimensional stability of the material is affected. Superior sta- 
bility is achieved when all retained austenite is removed [3]. 
With as little as 3.9 per cent retained austenite present, the dimen- 
sional stability of the structure becomes rather poor. While 
these characteristics change in relation to the retained austenite 
in a rather consistent manner, the hardness variation between 
structures changes in an anomalous way, not being consistent 
with either the elastic limit or the retained austenite. One would 
expect that as the hardness of the structure increases its re- 
sistance to plastic deformation would increase also, since hardness 
is a measure of a material’s resistance to penetration. But for 
52100 steel a high elastic limit was observed for the structure 
having a hardness of 58 Re, while a low elastic limit was observed 
for the structure having a hardness of 62 Re. This inversion of 
hardness and elastic limit has been observed before in the case 
of plain carbon steels [5]. 

Rolling Friction and Initiation of Plastic Deformation. The rolling 
friction characteristics of the four structures are compared in Fig. 
2, where the average rolling force is plotted as a function of the 
contact stress. The contact stress is the maximum compressive 
stress in the contact area and is given by 

3 P 

2 ma?’ 
in which P is the load and a is the radius of the circular contact 
area as determined from the Hertzian equations [6]. In each 
case the data were obtained by using a '/.-in-diam ball of the 
same heat-treatment as the plates. Each curve can be divided 
into three portions. The first is characterized by values of the 
rolling force so small that they cannot be measured with any ac- 
curacy with the apparatus; this portion extends from zero stress 
to a certain threshold stress that is dependent on material proper- 
ties. The second portion of the curve starts at this threshold 
stress and the rolling force increases linearly with the volume of 
stressed material until a point is reached at which plastic deforma- 
tion starts; the quantitative relationship between the rolling 
force and the stressed volume is discussed in another paper [7]. 
The third portion of the curve covers the region in which the 
contact stresses are sufficiently high to cause plastic deformation 
during initial rolling; in this region the rolling force increases as 
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Fig. 3 Avercge rolling force versus contact stress for a 
1/2-in-diam cemented-carbide ball rolling on plates of two 
different structures of SAE 52100 steel 
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Table 2 Observed stress for the initiation of plastic deformation for 
various 52100 steel structures 


Hardness % Retained PD stress, 
Structure Re austenite psi 
A 58 0 377,000 
B 64 3.9 373,000 
Cc 62 7.4 296 , 000 
D 59 18.4 160,000 


a power of the stress. The stress at which plastic deformation is 
first observed is marked by the letters PD for each of the curves. 

The location of the curves is a function of the retained aus- 
tenite content; in general, the higher the retained austenite con- 
tent the higher the rolling force for a given contact stress. The 
relationship between material structure and the stress at which 
plastic deformation was first noticed is also regular and is shown 
in Table 2. As the retained austenite content drops from 18.4 to 
3.9 per cent the PD stress is increased from 160,000 to 373,000 psi. 
A further decrease of the retained austenite to zero does not sub- 
stantially increase the resistance to initial plastic deformation. 
For the structures B, C, and D the stress necessary for the initia- 
tion of plastic deformation increases as the hardness number. 
While the hardness of structure A is considerably less than the 
hardness of structure B, the resistance to initial plastic deforma- 
tion of the two structures remains practica!ly the same. When 
structure A is compared with structure C it is evident that an in 
crease in hardness is accompanied by a decrease in the stress 
necessary for the initiation of plastic deformation, a situation 
similar to that found with elastic limit data obtained in tension. 

In Fig. 3 rolling friction results are compared for plates of 
structure A and C when run with a '/;-in-diam cemented-carbide 
ball. The change to the cemented-carbide ball, with its higher 
modulus of elasticity of 72 X 10° psi, alters the stress distribu- 
tion in the region of the contact. In addition, the rolling force is 
measured for the contact pair cemented-carbide/steel instead of 
steel/steel as in Fig. 2. While the absolute values of the rolling 
force cannot be compared with those of Fig. 2, the relative values 
of the rolling force and the stress at which plastic deformation 
starts in the plates confirm the data obtained with steel balls. 

The stress pattern is also affected by the diameter of the ball 
used in the run. In Fig. 4 the average rolling force is plotted 
against contact stress for runs using */,-in-diam balls of the 
same structure as the plates. The higher rolling force observed 
for structure D than for structure C and the values of the stress 
necessary for plastic deformation both confirm the data that were 
obtained with the '/,-in-diam balls. 
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It is interesting to compare the rolling force versus contact 
stress curves for the same material structure but for different ball 
diameters. The curves for structure C and ball diameters !/; and 
3/, in. are shown in Fig. 5. While the stress necessary for the 
initiation of plastic deformation is independent of ball diameter, 
the average rolling force for a given stress is greater for the */,-in- 
diam ball than for the '/,-in-diam ball. This difference represents 
the different volumes of stressed material involved in the rolling 
process for the two balls. The volume of stressed material is 
greater for the */,-in-diam bal! than for the !/:-in-diam ball even 
though the contact stress is the same. 

On the basis of the rolling force and initiation of plastic defor- 
mation data it is evident that structure A, containing zero retained 
austenite, is superior to the other structures for a low stress in- 
strument bearing application. Not only does this structure show 
a high resistance to the initiation of plastic deformation but the 
energy losses during rolling are considerably less than for struc- 
tures that contain retained austenite. As has been pointed out 
before, this difference in rolling friction behavior is important for 
instrument bearings because it is essential that they have mini- 
mum friction torque. Fortunately, the combination of low fric- 
tion and high resistance to the initiation of plastic deformation is 
achieved without sacrificing dimensional stability, which is a fur- 
ther requirement for a precision instrument bearing application. 

Plastic Deformation at High Stress. In addition to the measure- 
ment of the rolling friction and the initiation of plastic deforma- 
tion, the extent of plastic deformation at stresses well above the 
initiation point was investigated for the four structures. The 
depths of the grooves formed at these high contact stresses were 
measured with a surface profile instrument. Fig. 6 compares the 
groove depth formed by a '/2-in-diam cemented-carbide ball for 
these structures at contact stresses up to approximately 600,000 
psi. The points marked PD are again the stresses at which 
plastic deformation is first visually observed. In Fig. 7, the con- 
tact stress range is extended to almost 10° psi by using a '/,-in- 
diam cemented-carbide ball on the same plate structures. At 
stresses up to 400,000 psi structures A and B exhibit the least 
amount of plastic deformation. Above about 470,000 psi struc- 
ture B becomes superior, in its resistance to plastic deformation, 
to structure A. Above 570,000 psi structure A, Re 58 becomes 
inferior to structure C, Re 62. The superiority of the hardest 
structure, B, is maintained at these high contact stresses and 
the other structures resist plastic deformation in the order of their 
hardness numbers. 

These observations help to explain the previously mentioned 
anomalous relationship between hardness and the stress necessary 
for the initiation of plastic deformation, that was obtained for the 
structures at the 58, 62, and 64 Re levels. The rolling of a ball on 
a plane at stresses well in the plastic region of the material is a 
process of deformation akin to that which exists in a hardness 
test. In both cases excessive stresses are developed which severely 
deform the structure thus producing extensive work hardening 
and/or stress-induced phase transformations. At lower stresses, 
plastic deformation is just beginning and there is not enough de- 
formation to promote extensive phase transformation to increase 
in turn the resistance to macroplastic deformation. At this initial 
stage, since retained austenite is relatively soft compared to 
tempered martensite, the effect of the austenite is to lower the re- 
sistance of the aggregate ‘structure to microplastic deformation. 
Thus, as the data show, the extent of this weakening effect is a 
function of the amount of retained austenite present. On the 
basis of these data, up to 3.9 per cent retained austenite can be 
tolerated in the structure before a marked decrease in the re- 
sistance to microplastic deformation occurs. At high stresses, 
macroplastic deformation is extensive and is accompanied by the 
stress-induced transformation of retained austenite to martensite. 
In this stage the data show that the resistance to plastic deforma- 
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tion is a function of hardness. The more retained austenite 
present in the structure, at least up to a point, the greater is the 
contact stress necessary to complete the transformation to mar- 
tensite. For structure B, which contained 3.9 per cent retained 
austenite, this occurred at about 470,000 psi. For structure C, 
which contained 7.4 per cent retained austenite, this occurred at 
about 570,000 psi. 

A consideration of these findings indicates that hardness alone 
cannot be used as a basis for the selection of a heat-treatment for 
instrument bearings. In an instrument bearing application, the 
initiation of plastic deformation should be prevented; conse- 
quently, the superior material is the one with the highest elastic 
limit. From these results high elastic limit can be best achieved 
and maintained by using retained austenite content as the meas- 
uring yardstick. In the case of 52100 steel structures, those with 
no retained austenite have the best combination of elastic limit, 
rolling friction, resistance to microplastic deformation, and di- 
mensional stability. 


Conclusions 


1 The rolling friction observed for hardened and tempered 
SAE 52100 steel is related to the amount of retained austenite in 
the structure. A structure having no retained austenite has the 
least rolling friction. 

2 The stress at which plastic deformation is initiated, in roll- 
ing a ball on a plane, is also related to the retained austenite con- 
tent. The smaller the retained austenite content the higher the 
contact stress necessary to initiate microplastic deformation. 

3 The extent of contact plastic deformation which occurs at 
high stresses is related to the hardness of the structure. The 
hardest structure shows the least amount of macroplastic defor- 
mation. 

4 Maximum hardness is not necessarily a good criterion for 
the selection of materials for instrument ball bearings that are 
designed to run at low contact stresses. 

5 Retained austenite content is a superior yardstick on which 
to select and to insure high quality instrument bearing materials 
made of hardened and tempered 52100 steel. 

6 Inthe case of hardened and tempered 52100 steel, maximum 
elastic limit, dimensional stability, resistance to plastic deforma- 
tion at low stresses, and minimum rolling friction are achieved 
with a structure containing no retained austenite. 
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DISCUSSION 
Robert A. Bareiss* 


The subject matter which the authors have taken for their cur- 
rent work is of considerable importance to the manufacturers and 
users of antifriction bearings. It is therefore necessary that all 
aspects of this problem be given rigorous consideration. One of 
the major points made in this paper is the contact stress which 
causes the initiation of plastic deformation in the sample plates. 
In order to clearly establish the effects of retained austenite on 
plastic deformation, other contributing factors should be iden- 
tified and accounted for in the results of any tests. In this re- 
gard, the authors should consider the effects of residual stresses on 
the initiation of plastic deformation in their tests. It has been 
shown by a number of authors (see Bibliography at end of this 
discussion) that internal stresses do play a considerable part in 
plastic deformation of a ball on a flat plate. Tensile residual 
stresses lower the contact stress required for plastic deformation 
while compressive residual stresses tend to raise slightly this con- 
tact stress. The change in contact stress to initiate deformation 
seems to be proportional to the magnitude of the residual stress. 
Since the test specimens used by the authors have in some cases 
contained a considerable proportion of retained austenite, it 
seems reasonable to assume that residual stresses of significant 
magnitude may also be present. 
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D. G. Flom‘ 


The authors of this paper should be complimented on their con- 
tribution to bearing design and on the care with which their ex- 
periments were conducted. The important conclusion that 
hardness alone is insufficient to account for energy losses in pure 
rolling at low stress is of general applicability. In our own labora- 
tory we have found that this is true for polymeric materials and 
that elastic hysteresis losses must be considered. As a conse- 
quence of this result, temperature and speed of rolling are found 
to be very important factors. 

In view of Mr. Drutowski’s earlier work on energy losses for 
balls rolling on plates I have wondered if the authors have used 
their data to calculate specific damping capacities for the 52100 
steel as a function of retained austenite. In a very cursory search 
through the literature, I have found that studies of specific 
damping capacities for steel by more conventional techniques 


3 Vice President, Lessells and Associates, Inc., Boston, Mass. 
Mem. ASME. 


* General Electric Company, Schenectady, N. Y. 


Journal of Basic Engineering 


have been somewhat meager. In such a study one might again 
expect to find temperature and frequency effects to be important. 
In the case of rolling it would also be interesting to know if, within 
the elastic range, the dependence of friction on load and on ball 
diameter agree with the theoretical predictions of Tabor,’ 
Drutowski,‘ and Flom and Bueche.’? The latter dependence could 
perhaps be determined most easily with data using cemented 
carbide balls so that one would need to consider the deformation 
of the plate structure only. 

As the authors show in Fig. 7, the resistance to plastic defor- 
mation of the steels during rolling at high stress appear directly 
related to their hardness numbers, indicating that a stage well into 
the plastic region has been reached. However, full plasticity has 
apparently not been reached since contact stress is still increasing 
with increasing depth of groove. It would be interesting to see 
whether extensions of the curves in Fig. 7 would bend upward 
sharply at contact stresses corresponding to yield pressures ob- 
tained from hardness measurements on the respective steels. If 
such a result were found it would indicate that time-dependent 
effects were negligible for the rolling speeds used in these experi- 
ments. 

I should like to ask one question about the experimental re- 
sults. The rolling forces given in the figures are presented as 
average values. Since the retained austenite in the steels would 
presumably exist in the bulk matrix alongside relatively hard and 
brittle martensite it is conceivable that variations in rolling force 
could be detected, especially at low stress, as regions of austenite 
and martensite came under the rolling ball. Were such varia- 
tions observed or were these effects, if present, averaged out? 
If variations in rolling force existed, a correlation could perhaps 
be found by relating the distance between friction peaks and the 
per cent of retained austenite. 


Authors’ Closure 


The comment of Mr. Bareiss is a valid one and the authors 
appreciate its possible significance. In addition, it should be 
pointed out that the size and distribution of the residual micro- 
stresses should be considered in any detailed explanation of 
plastic deformation. In speaking of the effect of residual stress 
on fatigue strength, Dolan [1],* has pointed out that the size and 
distribution of the residual microstresses in a specimen are more 
important than the average value of the microstresses (the value 
commonly obtained in residual stress measurements). While 
such a detailed consideration would undoubtedly enhance the 
interpretation of the experimental results, the authors feel that 
it would not basically affect the explanation offered, i.e., the 
influence of retained austenite on the ability of the material to 
resist microplastic deformation as shown by its effect on the 
elastic limit. 

In answer to Mr. Flom, specific damping capacities were not 
calculated as a function of retained austenite. For the materials 
used, the specific damping capacity for a given ball diameter at a 
given contact stress varies directly as the rolling force. In the 
purely elastic contact region, i.e., below a load sufficient to cause 
plastic deformation, the specific damping capacity for a steel 
ball rolling on a steel plate is given by 


3.84 X 


obtained from equation (7) of a previous paper [2]. For a 
1/-in-diam ball at a contact stress of 260,000 psi the specific 


5 D. Tabor, authors’ Reference [1]. 

R. C, Drutowski, authors’ Reference [2]. 

7D. G. Flom and A. M. Bueche, “Theory of Rolling Friction for 
Spheres,” Journal of Applied Physics, vol. 30, 1959, p. 1725. 

* Numbers in brackets designate References at end of closure. 
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damping capacities of the zero and the 7.4 per cent retained 
austenite structures are, respectively, 0.06 and 0.22 per cent. 
These values are twice as large as would be obtained using 
Tabor’s [3] original expression for elastic work in rolling but agree 
to within 3 per cent of his modified expression [4]. 

Tabor [3] observed experimentally that the rolling force in- 
creases as a a power of the load, the exponent being between 1.7 
and 1.85. From theoretical considerations he predicted that at 
small strains the rolling force might be expected to increase as 
the */;-power of the load. Flom and Beuche [5], using a dif- 
ferent theoretical approach, also indicated that under certain 
conditions the rolling force should vary as the ‘/;-power of the 
load. Previously reported experimental data of Drutowski [2] 
agreed with these observations; for a cemented-carbide ball roll- 

“ing on a hardened 52100 steel plate the rolling force was found to 
vary as the 1.2-power of the load at low loads and as the 2.4- 
power of the load at high loads. The data for the steel structures 
reported in this paper disagree with the theoretical predictions. 
For the data of Fig. 2 at the high contact stresses the rolling force 
varies as the 2.3-power of the load for the 18.4 per cent retained 
austenite structure, as the 2.1-power of the load for the 7.4 per 
cent retained austenite structure, and as the 1.9-power of the 
load for the 3.9 per cent and zero retained austenite structures. 
For the range of loads at the lowest contact stresses the exponents 
increase to 3.3 for the 7.4 per cent retained austenite structure, to 
2.1 for the 3.9 per cent retained austenite structure, and to 2.9 
for the zero retained austenite structure, and this increase is in 
opposition to the theoretical expectation. When a cemented- 
carbide ball rolls on two different structures at high contact 
stresses, as shown by the data of Fig. 3, the rolling force is found 
tp vary as the 2.2-power of the load for the 7.4 per cent retained 
austenite structure and as the 2.7-power of the load for the zero 
retained austenite structure. For the lower contact stresses the 
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exponents decrease, respectively, to 2.0 and 1.8. The deforma- 
tion of the cemented-carbide ball is approximately 40 per cent that 
of the steel plates and should not be neglected; in fact, it appears 
that the contribution from the cemented carbide is enough to 
overpower the effect noted above for the steel /steel combinations. 

The authors must apologize for not making clear in this pres- 
entation that the measurements are plotted as a function of 
nominal contact stress, a practice often employed in rolling con- 
tact literature. The nominal contact stress is that calculated 
from Hertz’s formulas assuming no plastic deformation. If 
plastic deformation does occur the true value of the contact 
stress will be less than the nominal contact stress. For loads 
below that necessary to cause initiation of plastic deformation 
the true contact stress is equal to the nominal contact stress. 

During the experiments the rolling force was continuously 
recorded and the variations of the rolling force along the rolling 
path noted. The presence of rolling friction peaks for even the 
smoothest steel surfaces has been previously reported |2]. For 
the specimens used in this investigation there appeared to be no 
obvious relationship between number and/or size of the peaks 
and the per cent of retained austenite, but no detailed check 
was made of a possible correlation. 
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A General Theory for Elastically Constrained 
Ball and Radial Roller Bearings Under 


A. B. JONES 


Chief Research Engineer 
The Fafnir Bearing Company 
New Britain, Conn. 


Arbitrary Load and Speed Conditions 


A completely general solution is obtained whereby the elastic compliances of a system 


of any number of ball and radial roller bearings under any system of loads can be de- 
termined. Elastic yielding of the shaft and supporting structure are considered as well 
as centrifugal and gyroscopic loading of the rolling elements under high-speed opera- 
tion. The solution defines the loading and attitude of each rolling element in each 
bearing of the system as well as the displacement of each inner ring with respect to its 
outer ring. For ball bearings the precise location of the load paths in each raceway are 
found. Life estimates can be more accurately made since the fatigue effects can be 
evaluated over known paths in the raceways. The solution, which is accomplished 
numerically by iterative techniques, has been programmed for an IB M-704 digital com- 


puter. 


= their mechanical simplicity, rolling-element 
bearings, and in particular ball bearings, present some very com- 
plex problems when their actions under service conditions are 
studied. 

All calculations of rolling element bearing fatigue life require a 
knowledge of the bearing’s internal load distribution. Closed- 
form solutions for the forces on the rolling elements of a bearing 
are known for relatively few cases of loading. Existing closed- 
form solutions deal only with the case of parallel ring displace- 
ment and do not properly account for the change in contact angle 
found with ball bearings. Centrifugal ball loading and gyroscopic 
moments, important in high-speed bearings, are usually neglected. 

The precise determination of elastic yields, of extreme im- 
portance of the designer of precision inertial guidance system, is 
not possible with existing closed-form solutions. 

The reason for the inadequacy of existing solutions to properly 
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Nomenclature 


define a bearing’s internal load distribution lies in the complexity 
of the associated equations. Rolling element bearings constitute 
a statically-indeterminate, nonlinear, elastic system and the as- 
sociated equations are not easily reducible to closed-form solu- 
tions without certain, simplifying assumptions. 

The general equations for the elastic equilibrium of a ball bear- 
ing in three of the five possible degrees of freedom were given by 
the writer in Reference [{1].' All known, closed-form solutions 
for different cases of loading can be deduced from these equations 
by making the necessary, simplifying assumptions. However, 
precise solutions could only be obtained by tedious, graphical or 
numerical integration of the pertinent equations. 

With the advent of the modern, high-speed, digital computer, 
it became possible to quickly solve the most complex problem in 
bearing load distribution by numerical means. 

This paper extends the scope of the original theory of Reference 
{1] so that an entire assemblage of bearings, shaft, and supporting 
structure is considered as a single, elastic system. The solution 
defines the elastic compliance of a point on the shaft with respect 
to the supporting structure in five degrees of freedom. Elastic 
yielding of the shaft and supporting structure are considered 


! Numbers in brackets designate References at end of paper. 


A = a fatigue constant for ball 
bearings 
A’ a fatigue constant for roller 
bearings 
B total curvature factor 


ing inner raceway or to 
point of force application; 
L is positive when meas- 
ured left from point B, in. 


capacity of a ball or roller 
race contact for 90 per 
cent probability of sur- 


vival to 10° revolutions 


(CF), 
E 
F,, F,, F, 

K,, K; 


K,’, K,’ 


centrifugal force on rolling 
element, Ib 

pitch diameter, in. 

forces parallel to X, Y, and 
Z, \b 

outer and inner-race de- 
flection constants for ball 
bearings, in.'/* Ib~*/* 

outer and inner-race deflec- 
tion constants for roller 
bearings; units depend 
on roller design 

axial distance from point B 
on shaft to center of bear- 
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fatigue life for ball bearing, 
hr 

fatigue life for roller bear- 
ing, hr 

moments about axes paral- 
lel to Y and Z, lb in. 

gyroscopic moment acting 
on ball, lb in. 

speed of rotating race, rpm 

mounted diametral clear- 
ance in roller bearing, in. 

dynamic _rolling-element 
loads at outer and inner- 
race contacts, Ib 

static rolling-element load, 
Ib 


of inner race, lb 

orthogonal right-handed co- 
ordinate system 

semimajor axes of outer and 
inner-race pressure el- 
lipses, in. 

semiminor axes of outer and 
inner-race pressure el- 
lipses, in. 

ball diameter, in. 

ratio of transverse radius of 
ball race to ball diameter 

total number of bearings 
and points at which shart 
is loaded by external or 
body forces 
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through the use of influence coefficients. The exact, internal 
load distribution is found for all of the bearings in the system. 
Centrifugal forces acting on the rolling elements and gyroscopic 
moments acting on the balls are considered. Bearing lives are 
evaluated by summation of the fatigue effects of the rolling ele- 
ments’ passages over precisely determined paths in each bearing 
raceway. 

The present work has been programmed for an IBM-704 digital 
computer and, in addition to the elastic yields of the bearings 
and structural parts, and the bearings’ fatigue lives, yields other 
pertinent data. Among these are contact area dimensions and 
stresses at each ball position, locations of the pressure-ellipse 
extremities, centrifugal force and gyroscopic moment at each 
rolling element position, theoretical fatigue life, frictional effects 
due to rolling, and many other items of interest. 

Numerous, special-purpose programs for specific problems 
have also been developed using the basic theory outlined here. 
In several of these, the elastic deformations of the bearing rings, 
themselves, and their effect upon fatigue life have been con- 
sidered. 

The author wishes to acknowledge the assistance of Mr. George 
M. Cooley of The Fafnir Bearing Company in bringing the theory 
to its ultimate form. 


Theoretical Development 


Elastic Compliance of the System. Consider a system of p angular- 
contact ball bearings and ¢ radial roller bearings mounted on a 
nonrigid shaft and supported in a nonrigid structure. 

Let there be w points along the shaft where external loads or 
body forces are applied. 

Then the total number of points on the shaft where loads or 
bearing reactions occur isj = p + t+ w. The number of points 
at which the supporting structure is loaded is u = p + t. 

Forces and moments and bearing reactions acting on the shaft 
can have components in accordance with the co-ordinate system 
of Fig. 1. 

At each bearing position there will be structural yields and 
bearing deflections. These are referred to a similar co-ordinate 
system, Fig. 2. 

Fig. 3 shows a portion of the shaft and supporting structure in 
their deflected state in XZ projection. A similar situation exists 
in XY projection. 

In Fig. 3, u is any bearing location, B is a particular point on 
the shaft reference line. Shaft deflections and slopes are referred 


Nomenclatuze 


4 
SHAFT AXIS 
+F 
+F 
\ j 
~ ¥ 
Fig. 1 


SHAFT — 
> x 
+x 


to this reference line through influence coefficients. Deflections 
and slopes of the supporting structure are referred to the initial 
axis of the supporting structure through influence coefficients. 
é, is the compliance of the inner ring of the bearing at u with 
respect to its outer ring in the Z-direction. 

The shaft deflection at any bearing position u due to forces and 
moments in the XZ-plane is, relative to the shaft reference line 


An = [Ans + (1) 


lel to X, Y,Z 
angle between vector of 


l = length of roller, in. ball’s angular velocity 
p = number of ball bearings in about its own center and 
system X-axis, radians 
q = rolling-element position in- &,, & = angular displacements 
dex parallel to Y, Z, radians 
= subscript indicating gth roll- 8 = initial contact angie of ball 
ing-element position bearing after mounting 
r = radius of locus of ball-bear- Bo, Bi, = dynamic outer and inner- 
ing inner-race curvature race operating contact 
centers, in. angles 
¢ = number of roller bearings in 8, = static operating contact an- 
system gle 
u = total number of bearings in y = ratio d/E 
system 5. 5ig = approach of rolling element 
w = number of points at which to outer and inner races, 
shaft is loaded by exter- in. 
nal or body forces vy 8,, = angular displacements of 
%, 9, 2 = linear displacements paral- shaft at point B about 


axes parallel to Y, and Z, 
radians 


Subscripts , 


angular velocity of ball 
about own center, rad X 
sec™! 

angular velocity of spin of 
ball with respect to outer 
or inner race, rad X sec™! 

angular velocity of rolling of 
ball with respect to outer 
or inner race, rad X sec™! 

angular velocity of rotating 
race, rad X sec™?! 

orbital angular velocity of 
rolling element, rad X 
sec} 

and , refer to conditions at 


outer and inner race contacts, respec- 


tively. 
Subscript 


refers to conditions at the 


gth rolling-element position. 
Other notations as defined in text. 
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{F,,} and {M,,} are columns of forces and moments and are of 
orderj =p+t+w. 

{A,,,}] and [A’,,,] are square matrices of influence coefficients, 
also of order j. 

The slope of the shaft in the XZ-plane relative to the shaft 
reference line is 


ay, = If F.,} + yu My,} (2) 


The yield of the outer-race structure relative to the initial axis 
in the XZ-plane is 


View > [Vows + [Vous (3) 


and the slope of the outer-race supports relative to the initial axis 
is, in the XZ-plane 


Py, = Ih Fs} + IE (4) 


In the X Y-plane a similar situation exists, so that 


Aye = [Ava + Me) (5) 
Ory = + Ma} (6) 
Vou = Pod + (7) 
Pow = [Poul + Ma} (8) 


The interest is in the sum of the shaft and support structure 
yields so that 

= An + Vou = + (9) 

= yy + = Fa} + (10) 

Aye = Aye + Vou = + (11) 


= Ary Py = + }{ (12) 


where 
= [Ary] + and so forth. (13) 
Similarly, in the direction of the X-axis 
[Aca }{Fs,} (14) 
where 
= + [Veu;] (15) 


If there are initial preloads which are reactive on elastic struc- 
tural parts, then the columns of forces and moments must be ad- 
justed accordingly. 


= — (16) 


INITIAL AKES 
SUPPORTING 


OF SHAFT AND 
STRUCTURE 


Fig. 3 
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= {Fy — (18) 
{m’,} = {M,, — M’,,.} (19) 
= {M,, — M’,,..} (20) 


The items F’,,_,, and so forth, are initial loads or moments 
which exist at a bearing position u when the system is at rest and 
before the application of external load. 

If ry, Zp, Fy,, and are displacements of the shaft ref- 
erence line at point B relative to its initial position and attitude, 
the relative displacements of an inner race with respect to its 
outer race are 


= Dag t t+ [row + M's} (25) 


The items A’,,, and so forth, are initial deflections within the 
bearings as the result of preloading or the forcible application of 
initial constraints. 

Five, nonlinear, simultaneous equations define the equilibrium 
of the system. 


+ p> Pa = (26) 
> Fret Fm = 0 = (27) 
F.+ 2 F, =0=y; (28) 
> +> M+ = 0 = (29) 
2, My + LF ve + + =0=¥Ys (30) 


These equations can be linearized and solved iteratively by the 
Newton-Raphson method. 

Let 6,,° be initial estimates of the variables ry, 
and Diy in that sequence in Equations (26) through (30). Im- 


proved values are 6’,,.. 


where y, are the error functions. 
The elements a,,, of the matrix are 


(31) 


(32) 


(33) 


(34) 


(35) 


(36) 


(37) 
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Forces and Moments in a Ball Bearing Under High-Speed Conditions. 
Fig. 4 shows an angular-contact ball bearing in its initial attitude 
and the five displacements the inner ring may have with respect 
to the outer. 

Fig. 5 shows the convention of the ball position index q. 

Fig. 6 shows the initial and final positions of the inner-race 
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curvature center and ball center with respect to the outer- 
race curvature center at the gth ball position of a ball bearing at 
u. 

(AF), and (RF), are axial and radial functions which relate the 
ring displacements to the ball forces and contact angles. 


>. 


(AF), = Bd sin B + + + Gary sin—" (57) 
n n 


(RF), = Bd cos 8 + 5, sin —! + cos 
n n 


where AP p is any increase in diametral clearance of the mounted 
bearing. 

If centrifugal forces and the friction forces arising from gyro- 
scopic effects are considered, the forces and moments acting on the 
ball are shown in Fig. 7. 

The ball loads are related to the normal approaches of the ball 
to the races through 


P., = (59) 
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de = 
(Oy \ 
u 
- dz, dz, 
F M 
as = —— + (53) | | 
oF, . My | / 
so = + — 54 =n 
or (n-1) AD %= 
vu 
u | f 
| 


FINAL POS'TION 


INITIAL POSITION 


INNER RACE 
CURVATURE CENTER 


OUTER RACE 
CuevATvee CEenTee 


and = (60) 


Y At each ball position it is necessary to determine 8.4, Sig, Bogs 
and §,,. Since the iterative techniques of the Newton-Raphson 
method will be used to solve the associated nonlinear equations 
the angles 8., and 8,, are best stated in terms of the co-ordinates, 
V and W, in Fig. 6. 


Ww, 
in By. = 61 
sin (61) 
(f. — 0.5)d + 54] If 5,° are initial estimates of the variables V,, W,, 54, and 5, 
, in that order, improved values are 6,’, where 
(AF), — Wy _ 


dn = (63) 6,’ = — (69) 


(RF), — V, €, are the error functions, &, €:, &, and &, in that order, from 
— 0.5)d + (64) Equations (65) through (68). 
‘ 


cos By, = 


From Figs. 6 and 7 and Equations (61) through (64) 
(AF), — + (RF), — V,P au = —2((RF), — V,) (70) 
— ((f, — 0.5)d + =0 (65) a, = —2((AF), — W,) (71) 
Vi+We- (66) au = 0 (72) 
[x Mas) ay = —2[(f, — 0.5)d + (73) 
an = 2V, (74) 
= 2W, (75) 
[ -V,)- (AF), = —2{(f, — 0.5)d + 5,,] (76) 
— 0.5)d + an = 0 (77) 
(CF), =0=« (67) K 


Mas) — 0.5)d + — 0.5)d + 
d 


(f. — 0.5)d + 5.4) 


(78) 


[ Kar, + 


Me 1 1 
M 
[ (AF), — W,) + (RP), 3 KV — 0.5)d + — + “ioe 


— 0.5)d + — 0.5)d + 
=O0=« (68) (80) 
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CF) 
P 
Mes 
a 
- 
— 


K,((RF), — — 0.5)d + — K,((RF), — + — (AF), — W,] 


Me 1 K Ké 
= 2 4: 83 


& 
KW — 0.5)d + 6,,] — KW 
((f, — 0.5)d + 6,,}? (84) 


Kil(AP), - W — 0.5)d + — — — (RF), — V,) 


The forces and moments with which the inner race acts on the 
shaft at its center are 


Me 
] n | K,{(AF), — + —* (RF), V,] 
a [« — W 4 (RF), | q d q q 
vu — 0.5)d + 
— 0.5)d + 444] 
86 
Mg Me, | cos (89) 
n | K,{(RF), — (AF), W,] n 
Fy. 
= — 0.5)d + 844) 


, Me 
n_ | K,((AF), — + (RF), — V,) 
— 9.5)d + 


> K (RF), — (AF), — We) Me, | sin (90) 

For brevity let 6, = (%,, Fu, Zu» Byu» ey.) as the case may be. 


(AAF), 3 26, 
0.5)d + ( 26, 26, ) > 2 (AF), W,) 06, 
Ma, (XRF) ov. Me 26, 
oF > + rae 26, ~ Ob, {Ka [((AF d ((RF da 26, 


(° 25, - “((RF), — (AF), walt 


- sin (92) 
q=1 ((f; — 0.5)d + n 


(O(RF ov 3 06; 
IK; 0.5)d + dq = + 3 /21(RF Vel q 


— 0.5)d + 


— 0.5)d + 26. 26, + K6,, /*((AF), W,) 
06, — 0.5)d + 
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. 
sin ong (87) M,, =f 
n q 
F,, = 
cos — (88) 
n 
Then 
| 
4 
00, q=1 n 
4 i 4 


oR, 


02, 
HAF), 
oA F), 


oF, 
oy 


oV, oW, 


found. The solutions are 


Journal of Basic Engineering 


— 0.5)d + 6,,) 


Mag RF), 
26, 


XRF), 
(RF), 
— = cos 


ORF), 


ARF 


| (AF), — W,] 


ov, 


AAP), 
06 


‘) + (AF), — W,) 26, 


- - W,) + ‘as (RF), 


06, 


HAF), _ 


0 


n 


ang 


RF), 


n 


= 0 


0 


OAF) 
06, 


4+ (RF), V,) 


— 0.5)d + 


The derivatives of (AF), and (RF), with respect to the 6, are 
HAF), 


(96) 


(97) 


(98) 


(99) 


(100) 


(101) 


(102) 


(103) 


(104) 


(105) 


If now Equations (65) through (68) are differentiated with re- 
spect to the 6,, four simultaneous equations result which are linear 


in a5,’ 28,” 28,’ and 58 and from which the latter can be 


dé, | 
ORF) Ma OAF) 
= [bi,] 1 , 
| 23, 
| — 0.5)d + 
| Me, XRF) 
| - q q 


The elements, };,, of the matrix are 


fb. = (RF), — V,) (107) 
bs = [(AF), — W,] (108) 

bs = 0 (109) 

bu = — 0.5)d + (110) 

by = V, (111) 

ba = W, (112) 

bu = —[(f, — 0.5)d + 84] (113) 
bn = 0 (114) 


The remaining elements ba, ba, bss, bss, Dar, bee, baa, and by are, re- 
spectively, identical to 41, and ay, Equa- 
tions (78) through (85). 

The centrifugal force and the gyroscopic moment are 


(CF), = (115) 
Q 
Me - Q? sin (116) 
2/,\a/, 


C,; and C, depend on material density. Q is the angular velocity 
of the rotating race. 

The ratios (Q,/2) and (w,/Q2) depend upon which race is 
rotating and upon the type of ball ‘‘control.’’ In angular-contact 
ball bearings it is found that the ball will roll on one race without 
spin and that all spin occurs with respect to the other race. If 
no spin occurs at a race contact the ball is said to be controlled 
by that race. If gyroscopic slip of the ball is prevented, the 
ball’s angular-velocity vector lies in the plane of the bearing’s 
rotational axis and its attitude depends on the type of control. 

For outer-race control 


cin Bay 
cos + 


tana, = 


(117) 


For inner-race control 


(7), [ 


cos Ba + tan a, sin 8 


oq 


1 + ¥ cos B,, 


__Sin _ 
cos By, — (118) 
+ (106) 
Table 1 shows values of 
2 
The ratio (2s is 
J Q/, 
9 
cos §;, + tan a, sin B,, (119) 
¥ cos Bis cos a, 


june 1960 / 315 


06. 
= sin— (95) 
n 
|| 
|| 
= 
n 
= 
|| 
|| 
af 
* 
| 25, | | 
| ow, 
my 
3 
J 


Table 1 


Outer-race control 
cos (Bi, — Bo.) + cos Bi, 


Outer rotating 


Qe 
Value of ( 2 ); 
Inner-race control 
1 + 7 C08 Boy 


1 + cos (Bi, — Bog) 


1 — y cos 


1 + cos (8;, — Bo.) 


Inner rotating 


The upper sign is used with outer-race rotation; the lower sign 
with inner-race rotation. 

It is necessary to make an assumption as to the type of control. 
A check on the correctness of the assumption is given by the fol- 
lowing relations obtained from consideration of the moment re- 
quired to spin a ball relative to a raceway about the normal to the 
pressure area at its center. 

Outer-race control exists at a ball position q if 

008 (Big — 


B.,) > (120) 


Inner-race control requires that 


E(€)iq 008 (Big — Bog) > (121) 
E(¢€) is the complete elliptic integral of the second kind having 
the modulus sin € where 


(122) 


b and a are the semiaxes of the pressure ellipse at a race con- 
tact, calculated in accordance with the Hertz theory. 

Under certain conditions of operation a ball may be com- 
pletely out of contact with the inner race. At that time the outer- 
race contact angle 8,, and the gyroscopic moment M4, are zero. 
The force P,, is equal to the centrifugal force. 

Loss of inner-race contact will occur when 


(AF)*, + (RF), — (f, — 0.5)d — K,~/4CF),"*}? 


€ — 0.5)d]? (123) 

Forces and Moments in a Ball Bearing Under Static Loading. If the cen- 
trifugal force and the gyroscopic moment acting on a ball are 
negligible, the assumption 2 = 0 in the foregoing will yield a 
solution for the case of static loading. However, the computa- 
tions are unnecessarily long since many of the equations become 
redundant. 

The solution for static loading can be obtained from considera- 
tion of the potential energy in a deflected bearing. Here, how- 
ever, to preserve continuity the static case is deduced from the 
dynamic case as follows 

With Q = 0: 


P,, = Pi, = P, (124) 


Bog = Bi, ” B, (125) 


(AF), 


tar, 


(126) 


The total approach of inner and outer races at a ball position is 
6, = [(AF)* + (RF)*]'/* — Bd > 0 (127) 
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1 + cos (Bi, 


cos (Bi, — Bog) — ¥ C08 Bo, 
1 + cos (Big = Bo,) 


The ball load is 
P= K6,"* 


lent 


The forces and moments with which the inner race acts on the 
shaft at its center are 


where 


(129) 


Fy, = P, sin B, 


q=1 


2m 
= P, cos B, sin 
n 


n 
= 2. P, cos B, cos = 


q=1 


n 
9 
=F > P, sin B, cos - 
q=1 


(130) 
(131) 
(132) 
(133) 


n 
=f > P, sin B, sin — 
n 


q=1 


(134) 


The associated derivatives are 


= K >> + V, sin® B,] 


q=l 


oF 3, 
135 
or, (135) 


OF (136) 


n 
. 
= K 2, V, sin 8, cos 8, sin 


2r 
— = K > V, sin B, cos B, cos (137) 
n 


q=1 


n 

zu . q 
= rK > [U, + V, sin? B,] cos - 

= q q 


oF 


n 
. 
rK 2, {U, + V, B,] sin 


(138) 


(139) 


F 2 
=K V, sin B, cos 8, sin 


= [U, + V, cos? B,) sin* 


(140) 


(141) 


oF 
= K [U, + V, cos? sin 


n 


(142) 
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— 
ey? 
(128) 
| 
cos € F.. 
Mu 
a 
Pho 
Se 
oa 
and 


V. si in — — 143 
2a, r sin B, cos B, sin (143) 


= V, sin B, cos 8, sin? (*) (144) 
= V, sin B, cos B, cos (145) 


ary 


= K — (1 
K «cost B,] sin cos (146) 
OF 


OF >> V, sin B, cos B, cos* (*") (148) 
n 


n 
(U,+ V, cost cost ( ) (147) 


= rK V, sin B, cos sin cos (149) 


q=1 


= 2r 
+ sin® By} cos“ (150) 


q=1 


oM 2rq_ 
=rK V, sin B, cos B, sin (151) 
q= 


Ve sin cos B, cost (152) 


q=1 


vu [U, + V, sin? B,] cos* (= (153) 
r (U, + V, sin? B,] sin (154) 
=rK [U, + V, sin* B,] sin (155) 


rK V, sin 8, cos sin* (156) 


q=1 


n 
J 2 . 
V, sin B, cos 8, sin —~ cos 


2nq 
q=l n 


(157) 


n 
= r'K > U V — (158 
r 2, [U, + V, sin® sin cos (158) 


= [U, + V, sin* sin* (159) 


where 


+ 3(Bd)} 

V, = 161 

"318, + (Ba)] 

Forces in a Radial Roller Bearing Under High-Speed Conditions. 

A radial roller bearing can be loaded only by the displacements 
9, and 2, if misalignment effects are small. 

For a radial roller bearing with initial diametral clearance Pp 
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when mounted, the total compression at a rolling element is 


5, = 9, sin + 2, cos -— (162) 


5, must be positive since the roller cannot take tension. 
There exists no precise formula for calculating the approach of 
a roller to a raceway as this depends somewhat on the nature 
and degree of the crowning of the roller or raceway. However, 
an expression of the form 
Py = (163) 
Pa = (164) 
can be fitted to experimental data. 
The centrifugal force on a roller is 


(CF), = (165) 


The upper sign applies to inner-race rotation; the lower to outer. 
The total compression of a rolling element is the sum of the 
compressions at the inner and outer-race contacts. 


5, = + 4,, (166) 
also 
P., = Pi, + (CF), (167) 


K’(6, — — — (CF), = 0 = A, (168) 


if (CF), > K,’5,', P., = (CF),, and 6,, and P,, are zero. 

If b is other than unity, establish the error function A,. If 5;,° 
is an estimate of 4,,, a better estimate is bi’, where 
Ay 


= §° >0O (169) 


and so forth. 
If } is unity 


~ (CP)y] 
+ Kd 


-—(C 
= >0 (171) 
‘| 
K’, 


The forces with which the roller bearing acts on the shaft are 


>0 (170) 


Po 


F,, = 0 (172) 


9 
Pre = sin (173) 
q=l 


Qn 
= K', cos (174) 
q=l 
M,, = 0 (175) 
M,, = 0 (176) 
The associated derivatives are 
oF. 


oF ve 
Hew) 


0 (177) 
0 (178) 
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and 
oM 
oy = 
4 
oM,, 
‘ 
j 


5, — bi, 


2mq 
(196) 


Byus 


n 
OF = 6-1, sin? (197) 
2 2 14+ 
n n 
5, 1p { n = n (is 
1 q= 


6, bi, 


OM, 


dus 


oM,, 
= 185 


If b is unity 


Fw = — (CF),) sin“? (186) (202) 
K’; n 


on 


: 1 2mq = The associated derivatives are given by Equations (188) 
Fa, = ; [K’,6, — (CF),] cos — (187) through (191). 
EB + | . Special Cases of Initial Constraint. Up to this point it was as- 
, sumed that all outer rings were rigidly attached to a flexible sup- 
K’; . porting structure. There are many instances where one or more 
—— > sin? | — ‘) (188) outer rings are free from constraint in one or more degrees of free- 
oF [: + = ‘| q=1 dom or where the constraints are different than those dictated 
K’, by the structural flexibilities. 


- Some of the more common cases are treated in the following. - 
Fy ein cca (189) Single Ball Bearings With Axially Floating D's. In system hav- 
Oz, [: + | at n n ing u bearings let the vth ball bearing be so mounted that its OD 

K' is free to slide axially without restraint. 

The matrix of influence coefficients [,A.,,,) should contain 

K’, 2mq zeros in the vth row and column. 
sin (190) Let there have been made an assumption of the five displace- 
[: + ] ments 2g, and from which have been obtained 

e 


Fumv» Zumv, Xyyer, and &,-,. Then the value of %,-, is that which 
satisfies the relation 


=0=@ (203) 


If #°,» is an initial estimate of %,-., an improved estimate is 
I 
wv, Where 


Forces in a Radial Roller Bearing Under Static Load. Under static 


, ® 
(CF), = 90 (192) 
Po, = Pi, = P, (193) The derivative required in Equation (204) is given by Equa- 
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“a 
q=1 i+ — 
j K’, K’ on ‘ 
n 6-1, sin — cos — 1+ 
OF yy n n 80 K’, 4 
K’, 
4 9 
V5 Fy, = K’ | 5, sin (201) 
F, K’ 2 
— 


tion (91) with 6, = 2... for the dynamic case or by Equation 
(135) for the static case. 

The converged value of %,.., together with Jue, Zumv, 
and @,,., are used to obtain new estimates of zg, yg, 2g, Dy,, and 
v,,. During the latter process only all derivatives with respect 
to %,., are set equal to zero. 

Single Ball Bearings With a Spherical Outer Ring OD. If the vth ball 
bearing has its outer ring mounted in a spherical seat the moment 
reaction at that bearing is zero. 

The influence-coefficient matrices [:@’y4;], and 
{:@’,,,;| should contain zeroes in the vth row and column. 

For an assumption of the five displacements at point B values 
Of Fumr, aNd Z,-, are obtained directly. 

The values of &,,., and &,,_, are those which satisfy the rela- 
tions 


My. = 0 = x (205) 
=(Q= X2 (206) 


Let &°, be initial estimates of &,,_, and &,_, where j = 1, 2, 
respectively. Improved estimates are @,’, where: 


= — (207) 
where: 
i = 
= (209) 
— 210 
(3%) 
Cn = (211) 


The converged values of &,,., and &,,., are used with 2,_., 
Jw-e, and &,.. to obtain new estimates of the displacements at 
point B, and so forth. In the latter process only all derivatives 
with respect to &@,,., and &,,, are set equal to zero. 

Multiplex Bearings Which Act as a Unit and Whose ODs Are Free to 
Float Axially. Consider a system of bearings in which v ball bear- 
ings are so arranged that their outer races are axially fixed rela- 
tive to each other but free to move axially in their housings with- 
out restraint. 

Let the relative axial motion of any inner race with respect to 
its outer race be z. Then the total axial displacement of an inner 
race with respect to its outer race is %,-.. 


lune = 32 + (212) 


Assume that an initial choice has been made of the displace- 
ments at point B from which have been calculated j,-., 2,-:, 
Qyun», and &,,.,... Then the value of z is that which satisfies the 
relation 


For an initial estimate, 2°, of z an improved estimate is x’ 
where 


» Rane 


The converged value of x enables new estimates of the dis- 
placements at B, and so forth. 


Journal of Basic Engineering 


Calculation of Ball Bearing Fatigue Life. The Lundberg-Palmgren 
theory, Reference [2], is the basis for the AFBMA standard 
method for evaluating load ratings of ball bearings and is used 
by most bearing manufacturers in establishing catalog ratings. 
However, this theory is based upon the assumption that parallel 
ring displacement exists, that is, that the displacements @,, 
and @,,, are zero. Also, it assumes that, under combined load, 
the operating contact angle is the same at each ball position. 

The foregoing solution of the elastic problem yields directly the 
true ball load-distribution and contact-angle variation. By 
evaluating the fatigue effects of the ball passages over an exact 
path, a more accurate estimate of fatigue life is possible. 

Equation (115) of Reference [2] gives the capacity of a race 
contact for 10° revolutions of the inner race and 90 per cent proba- 
bility of survival. 

Recent work, as yet incomplete, has shown that the spin of the 
ball relative to a raceway has an important effect on fatigue life. 
Although this effect has not been fully evaluated, preliminary 
results indicate that an exponential relation exists between the 


capacity of a race contact and the ratio —*. Therefore, Equation 
w 


Rq 
(115) of Reference [2] should be modified as follows 


The upper sign is used for an inner-race contact and the lower 
sign for an outer-race contact. 

For a raceway in which a particular point supports the same 
load at each ball passage the life of the race in hours for 90 per cent 
probability of survival is 


10° 
— (216) 
1 P 10/, }-9 
Qe 
For a raceway in which the loading of a particular point varies 


cyclically with time, the life of the raceway in hours for 90 per 
cent probability of survival is 


slay 


10*n 
= —— (217) 


3 
won 
q=1 


The life of a complete ball bearing is 


The ratio —*, required in Equation (215) is, for an outer-race 
w 
contact 


Pos = —(1 + ¥ cos Bua) tan (Boy — a,) + ¥ sin Bus (219) 


and, for an inner-race contact 


Ws, 
=(1 — ycos tan (B;, — + sin Bi, (220) 

Ri, 

Calculation of Roller Bearing Fatigue Life. The calculation of 
the fatigue life of a radial roller bearing is greatly dependent upon 
the manner in which the rollers are crowned. Under light loads a 
continuously crowned roller may have substantially point con- 
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tact and the pressure area is elliptical. Under increasing load the 
pressure area encompasses the whole length of the roller and, at a 
certain value of load, the pressure distribution is substantially 
uniform along the roller’s effective length. At higher loading, 
concentrations of stress occur at the roll ends. For substantially 
light loads corresponding to elliptical contact areas (point con- 
tact) life is inversely proportional to the third power of the load. 
For heavier loads corresponding to line contact, life is inversely 
proportional to the fourth power of the load. Unequal load dis- 
tribution along the roller length introduces further complications. 

Most aircraft roller bearings usually have rollers which are 
cylindrical for about */; of their length. Crowning is accom- 
plished by tapering the roll ends slightly for a distance approxi- 
mately 1/. of the roller length from each end. Such bearings are 
relatively free from the harmful effects of edge loading at all 
loads, provided alignment is good. Since substantially line con- 
tact exists, the life is inversely proportional to the fourth power 
of the load. The discussion of roller-bearing life is restricted to 
bearings of this type. 

Equation (116a) of Reference [3] gives the capacity of a race 
contact for 90 per cent probability of survival to 10° revolutions 
of the inner race, for line contact, as 


Q.’ A {l + 


(221) 


The upper signs are used with an inner raceway and the lower 
with the outer raceway. 

For a raceway in which a particular point supports the same 
load at each roll passage the life of the race in hours for 90 per 
cent probability of survival is 


106 


(222) 


For a raceway in which the loading of a particular point varies 
cyclically with time, the life of the raceway in hours for 90 per 
cent probability of survival is 


10*n 


q=1 


(223) 


The life of the complete roller bearing is 
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DISCUSSION 
T. Tallian? 


The paper deals with a subject of considerable practical inter- 
est. No general method for obtaining analytically the forces and 
elastic deflections in a redundant system of radial ball and rol- 
ler bearings, supported by elastic members has been published 
be‘ore, as far as I am able to ascertain. Yet the problem, and 
several special cases thereof, arises frequently and the analyst 
seeking a solution to it will highly welcome the aid afforded by a 
generally applicable method. By making the IBM computer 
program available to the public, the author could further ensure 
general usefulness of the work done. 

With respect to the fatigue life calculations outlined in Sections 
entitled, ‘Calculation of Ball Bearing Fatigue Life’’ and ‘‘Cal- 
culation of Roller Bearing Fatigue Life,’’ p. 319, a word of cau- 
tion seems appropriate. The Lundberg-Palmgren theory, given 
as Reference [2] of the paper, is based on the assumption of 
pure normal loading of all rolling contacts in the absence of a 
hydrodynamic lubricant film affecting the stress distribution in 
the contact. Its applicability to practice, as in the AFBMA 
standard load rating method, is ensured by the empirical fact 
that, for the usual operating conditions, the actual (but analyti- 
cally inaccessible) stress conditions are proportional in their 
fatigue effects to the theoretically used stress conditions. Em- 
pirical factors of proportionality give correct orders of life 
length. 

Attempts to improve the precision of the Lundberg-Palmgren 
theory or to extend it to operating conditions for which it was 
not empirically verified, should, in my opinion, be directed to- 
ward better definition of the stress conditions actually existing. 
If, instead, one applies correction factors to the end-results as in 
Equation (215), then the formulas obtained become purely em- 
pirical, and are no longer a logical outgrowth of the theory. 
They are interpolation formulas only, applicable to conditions ex- 
perimentally elucidated. 

As an example of cases where experimental results contradict 
an extrapolated theory, I point to some experiments indicating 
that, under certain conditions of slip at the contacts, the mecha- 
nism of failure changes from fatigue flaking to surface damage. 
Clearly, a fatigue theory cannot carry over to operating con- 
ditions where such occurs. 

Regarding the Section on Roller Bearing Fatigue Life, similar 
remarks may be made. It is, e.g., experimentally established 
that crowned roller bearings will not behave as predicted by a 
Hertzian fatigue theory if lubrication is marginal. It appears 
that, in that case, lubrication breaks down at the roller ends 
even though the roller is crowned and this results in unexpected 
failures. 


2? SKF Industries, Inc., Philadelphia, Pa. 
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A Sommerfeld Solution for Finite Bearings 


J. V. FEDOR 

Naval Ordnance Laboratory, 
Silver Spring, Md. 

Assoc. Mem. ASME 


With Circumferential Grooves’ 


A method of solution is developed that circumvents the algebraic complexity in the solu- 
tion of Reynolds dafferential equation applied to full journal bearings. 


The method 


leads to equations for journat-bearing characteristics that are in finite form. Salient 
features of the complete-oil-film solution are: As the eccentricity ratio b approaches 
1, the load capacities of all finite bearings approach that of the infinite bearing; also, 
the friction curve intercept, (r/c)f, is 1 for all finite bearings when b equals 1. 
Results are compared with calculated values published by Muskat and Morgan for 
0<6<0.6. Excellent agreement is found throughout the compared range. 


|. HAS BEEN a long-standing belief of many scientists 
and engineers in the lubrication field that it is unlikely that a 
computable analytical solution to Reynolds three-dimensional 
equation for the full journal bearing would be obtained. If 
papers by Muskat and Morgan [1]* and Jacobson [2] are ex- 
amined, one is impressed by the complex nature of even approxi- 
mate solutions to Reynolds equation. When one applies classical 
methods of solution (separation of variables), the justification for 
this belief becomes apparent. A formal classical solution for 
Reynolds equation is constructed in Ref. [9] and it will not be re- 
peated here. 

The question of boundary conditions has compounded the 
problem further (see[ 3] and [4]). It is generally agreed that, with 
an adequate oil supply and relatively light loads, it is possible to 
provide sufficient oil to form a complete oil film. This forms the 
basis for the Sommerfeld boundary condition; that is, the pres- 
sure and its first derivative must duplicate themselves when going 
from 0 to 2x. The Sommerfeld boundary condition will be used 
in this paper because of the mathematical simplicity that it pro- 
duces. It is the purpose of the following section to outline the 
journal-bearing problem and present a method of solution which 
circumvents the algebraic complexities that arise in the solution 
of Reynolds equation. 


Analytical Solution of Reynolds Equation 


Reynolds equation for the pressure distribution in a full journal 
bearing is 


+ b cos 6)? | + bcos 6)? 
=. AS =. 
06 dz dz 
= (1) 


where 


P = pressure at any point in oil film above atmospheric pres- 
sure 


! Part of this paper was prepared by the author in partial fulfillment 
of requirements for the degree of Doctor of Philosophy, Mechanical 
Engineering, University of Maryland, College Park, Md. 

2? Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division of Tae AmeRICAN 
Society or MecHanicat and presented at the ASLE- 
ASME Lubrication Conference, New York, N. Y., October 20-22, 
1959. 

Nore: Statements and opinions advanced in papers are to be con- 
sidered as individual expressions of their authors and not those of the 
Society. Manuscript received at ASME Headquarters, December 29, 
1958. Paper No. 59—Lub-4. 
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linear velocity of journal 

radius of journal 

difference between journal and bearing radii 
eccentricity of journal and bearing axes 
viscosity of oil-constant 

oil film thickness = c (1 + 6 cos @) 
eccentricity ratio = e/c 

angular co-ordinate 

co-ordinate in axial direction 


The co-ordinate system shown in Fig. 1 will be used in this 
analysis. 

Putting a circumferential groove into a bearing is equivalent to 
cutting the bearing in two parts and providing each part with a 
uniform lubricant pressure around the adjacent ends. The pres- 
sure distribution in the interior of the two parts will be independ- 
ent of each other and each will behave as an independent bearing 
of length determined by the total length of the bearing and the 
position of the groove. Therefore it is necessary to consider only 
end-fed journal bearings when dealing with bearings with circum- 


Fig. 1 Cross section of journal bearing 
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ferential grooves. For convenience, a journal bearmng with a 
groove located at the center of the bearing will be discussed, see 
Fig. 2. In the axial direction, the origin of the co-ordinate system 
will be located midway in the end-fed bearing. It is assumed that 
the width of the groove is small compared to l. 

First the pressure distribution must be found in the bearing by 
solving Reynolds equation. The journal-bearing characteristics 
are then obtained by integration. The following boundary con- 
ditions will be imposed on the pressure equation: 


1 The pressure function and its first derivative must duplicate 
themselves in going completely around the bearing; i.e., 
oP(0, z) oP(2z, z) 


P(0, z) = P(2m,z) and 20 30 


2 The pressure function must vanish at the end of the bear- 


ing: P(0,l/2) = 0; andatz = —//2, the pressure must be P», the 
pressure of the circumferential groove. 
‘For the total pressure function let 


= + + cos 6] sin 6 


+ b*)[1 + b cos 6)? 
— A, (2) sin (2) 
i 


It will be noticed that Equation (2) meets the first boundary con- 
dition. The first term represents the circumferential source func- 
tion, the second term is the pressure equation for the infinite bear- 
ing, P..(@), and the third term represents a correction function to 
take into account the finiteness of the bearing. It will be readily 
verified that the first two terms satisfy Reynolds equation. After 
equating coefficients of sin m@ to zero in Reynolds equation, the 
following recurrence formulas are obtained from the third term 


2(D* — 1)Ai(z) + b(D? + 2)AxAz) = 0 (3) 
and 
2(D? — m*)A,,(z) + b(D? — m? — m + 2)Am-(z) 
+ b(D? — m? + m + 2)Ami(z) =O m>1 (A) 
where 
2 


d 
D? = — 
dz? 


At first glance Equations (3) and (4) look quite formidable to 
solve. Not only is Equation (4) a three-term recurrence equa- 
tion, but it also is a differential equation. As a gambit suppose we 
let Ao(z) be some proportion of A,(z). That is, 


A,(z) = —€A,(z) (5) 


where € is a positive quantity to be determined. Making this sub- 
stitution, Equation (3) becomes 


[(2 — eb)D? — 21 + e€b)|Ai(z) = 0 
or 
(D? — a*)Ai\(z) = 0 (6) 
where 
2 — 


Equation (6) is a simple second-order differential equation with 
constant coefficients and the solution is straightforward. To 
satisfy the boundary conditions at the ends of the bearing, A,(z) 
must be even in z. Hence we take 
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Fig. 2 Diagrammatic representation of bearing with circumferential 
groove 


A;(z) = C, cosh az/r (7) 


where C; is an arbitrary constant. 
It follows from the initial assumption that A2(z) is given by 


A(z) = —e€C, cosh az/r (8) 
A,(z) is determined from the equation 
2(D? — 4)A2(z) + b(D? — 4)A,(z) + =O (9) 
and A,(z) from 
2(D? — 9)Axz) + b(D? — 10)Ax(2) 
+ b(D? — 4)Agz) = 0 (10) 


Solving for A;(z) and A,(z) yields 


2 4) 
A,(z) = (3: 1) — C, cosh az/r + C; (11) 


a’ 
and 
2 (a? — 9) 9C 
A(z) = ( 1) C, cosh az/r — 
7— 10)C 
cosh az/r + C, cosh 22/r (12) 
a — 


In general Am+4:(z) can be evaluated from the recurrence formula 
2 
(D? — m? + m + 2)Ami(z) = — b (D? — m?*)A,,(z) 


— (D* — m* — m + 2)Analz) (13) 


Since Equation (13) is a second-order differential equation for 
Am+i(z), there will always be constants available in each Am+i(z) 
to satisfy the boundary conditions at the ends of the bearing. 
The remaining boundary conditions will now be applied: At the 
end of the bearing (z = 1/2) the pressure function must vanish 


6uUrb[2 + bcos 6 |sin 6 


(2 + b*)[1 + b cos 6)? 


— >> A,,(1/2) sin (14) 
1 


This requires that the first term, P.. (@), in Equation (14) be ex- 
panded in a Fourier series. This can be done by noting the 
P.(0) can be separated into two terms and each term expanded 
jnto a Fourier series 


6yuUr b i 
+ + con 6) * (1+ bcos 


(15) 


sin 6 


It is a well-known fact that + 


can be expanded 


in a Fourier series [5] 
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P=0 
4 
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sin 6 
1 + 2acos0 +a? ‘dno 


which is uniformly convergent for 0 <a <1. If the first term of 
Equation (15) is taken and let 


2 
then 
_bsin 2a sin 6 
1 + bcos 8 1 + 2a cos 0 + a? 


= 2>> sin (16) 
1 


Differentiating Equation (16) with respect to b results in 
re) b sin 6 » bsin 9 
ob 1 + bcos 0 ° (1 + b cos 6)? 


™ sj mA 


Combining Equations (16) and (17) the Fourier series expansion 
of P(@) is obtained 


0) = 
(18) 
where 
1+ (1 b?)' 2 


Substituting this into Equation (14) leads to 


— A,(1/2) sin (19) 
1 
Equating coefficients of sin m@ results in the following 


12uUr( — )" m 
A,(l/2) = 


Now € [from the assumption Ao(z) = —€A;(z)] can be evaluated 
by taking the first two terms of Equation (20) 


12uUra = 
c(2 + b?) + (1 — bt)'72 (21) 


12¢Ura* [1 2 
~ (2 + (1 — 


A,(l/2) = C; cosh a8 = 


Al/2) = cosh a8 = 


(22) 
where 8 = 1/2r 
Dividing one equation by the other gives 
a[2 + (1 — b*)'/*] + (1 — 


It is thus seen that € is evaluated by the boundary condition at the 
end of the bearing. It is readily verified that the pressure func- 
tion [Equation (2)] reduces to Po when z = —1/2. 

In the following section the convergence of the series in Equa- 
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tion (2) is demonstrated. Since Equation (2) satisfies Reynolds 
equation and meets the boundary conditions specified, it is a 
solution of the problem under consideration. It will be noticed 
that the method used to solve Reynolds equation does not re- 
quire the use of any expansion theorem except that connected 
with Fourier series. Thus the method is simple and easy to apply. 
In addition, it will be seen that this method leads to simple equa- 
tions for the journal-bearing characteristics. 


Convergence of Series 


It will now be shown that the series 


A,,(z) sin m0 


converges in the region under consideration. This will be done 
by getting an approximate expression for A,(z) for large m and 
investigating convergence. Making the approximation 


(m — 1)(m + 2) > (m — 1)? 
(m + 1m — 2) > (m + 1)? 
for large m, Equation (4) becomes 
2[D? — m*)A,(z) + b[D? — (m — 
+ b[D? — (m + 1)*JAnu:(z) = 0 (24) 
Let 
[D? — m*]A,(z) = F,, (25) 


where F,, is a function of m, b, and z. Substituting into Equation 
(24) gives the following second-order difference equation with con- 
stant coefficients 


+> bF,, 1 bF = 0 (26) 
It will be verified that a solution of Equation (26) finite at b = 0 


is 


=(-)* (, an) f(z) = (—)a"f(z) (27) 
where f(z) is an even function of z independent of m. Equation 
(25) now becomes 

(D? — = (—)"af(z) (28) 
An even solution of Equation (28) is 


A,,(z) = C,, cosh mz/r 
+ 429 sion = (z — 2z’)dz’ (29) 
mr r 


It is reasonable to assume that f(z) is bounded for 0 < b < 1 and 
z finite. If f(z) is replaced by its maximum value M, the right- 
hand side of Equation (29) becomes 


(—)"a"M 


C,, cosh mz/r — 
m? 


(30) 
The constant C,, can be evaluated from the boundary condition 
at the end of the bearing. Atz = 1/2, 


A,(1/2) = Cy, cosh me — 


M 
+ 
C.= (31) 
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Equation (30) becomes 
A,,(l/2) cosh mz/r (—)"a"M / cosh mz/r 
cosh mB ar ( cosh mB 1) (32) 


Taking the absolute value of the terms in Equation (32) the 
following inequality exists for large m: 


|A,,(#/2) cosh a™M leosh mz/r 


< 


(33) 


|A,,(2)| < |A (34) 


where the inequality has been improved by neglecting some of 
the terms. 

Denoting the right-hand side of Equation (34) by B,, it is 
noted by the Cauchy ratio test that the series formed by B 


T 
is absolutely convergent since a is less than 1 for 0 < b < 1. 


Since |sin m6| < 1 and |A,,(z)| < B,, for large m, it follows from 
the comparison test that the series 


A,,(z) sin m6 


absolutely converges for < z < 1/2,0 < < 2m, and0 <b 


Journal-Bearing Characteristics 


Load Capacity. All of the A,,(z) can be calculated from the re- 
currence formula Equation (13), but this is not necessary when 
determining the characteristics of journal bearings operating with 
the Sommerfeld boundary condition. The load components are 
given by 


-f- P(8, z) sin 8 rdBdz 


(35a) 


"P(6, z) cos rdOdz 


(35b) 


By direct integration, noting the orthogonality of sin m@ and 
cos m8, it will be found that 
W = 0 


that is, ¢ = m/2 and W has the very simple form 


w-W. (: W.K 
aB 


c(2 + b*\(1 


2(1 + eb) 


_ bI2 + (1 — 
+(1— b2)'/2]2 


Notice that as 8 — o, the load capacity goes over to the Sommer- 
feld load capacity for the infinite bearing as it should. 

Coefficient of Friction. The shear stress acting on the journal- 
bearing surface [6] is given by 
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(38) 


where the + refers to the traction on the journal and the — to 
that on the bearing. Substituting h = c(1 + 6 cos @) and inte- 
grating the right-hand side over the journal or bearing surface 
gives the friction force 


f pU 
-1/2 e(1 + b cos 6) 


+ bcos 6) dP 


rd@dz (39) 


Integrating the first term completely and integrating the second 
term by parts and noting the boundary condition P(0, z) = 


P(2n, z) results in 
f P(6, z) sin 6 rdO@dz (40) 
1/2 


ei — — 

Now the integral is simply the component of load perpendicular 
to the line of centers, W sin g. The coefficient of friction is de- 
fined as the ratio of the friction force to the load capacity of the 
bearing. Dividing by the load W, rearranging, noting that g = 
a/2, and using Equation (36) the coefficient of friction is given 
by 


f = 2+? + b/2 


(41) 


A convenient dimensionless quantity often used in journal- 
bearing analysis is the Sommerfeld variable S. This is defined as 


r\? uN 
( c ) Ww’ 
where W’ is the load per unit of projected area and N is the angu- 


lar speed of the journal. Substituting Equation (36) into (42) re- 
sults in 


(42) 


(2 + — 


S= (43) 

End Oil Flow. Reference [6] gives the following expressions 
for the flow in the circumferential and axial directions for unit 
width in a direction perpendicular to the flow 


(44a) 


h? oP 

44b 

de 
The axial flow (end leakage) is obtained by integrating Equation 
(44b) over the circumference of the bearing at the end of the 


bearing. Hence the oil flow out of one end of the bearing is 


] rd0 


Substituting Equation (2) into (45) and carrying out the integra- 
tion leads to 


(45) 


+ 


1248 


(46) 


or in dimensionless form 


m(1 + 3b*/2) 
128 
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(36) 
where 
Q=- f 
and 


The subscript is used to indicate that the flow is due entirely to 
the circumferential groove. 

To insure that the oil film is continuous, the condition is im- 
posed that the axial pressure gradient at the end of the bearing 
be negative [1] 


oz 


where 6,, is the angle at which the tendency for positive gradients 
is the greatest. Equation (48) insures that the velocity of the oil 
is positive (outward) at the end of the bearing. Taking the first 
term of the series and rearranging, a simple equation is arrived at 
that imposes a minimum value for the feed-pressure parameter 


2 
( Ps 48rbaB tanh ag 


uN = (2+ — 
It is clear from Equation (49) that for a given eccentricity and 
length-to-diameter ratio that a continuous oil film depends not 
only on the feed pressure but also on the viscosity, speed of the 
journal, and the ratio of clearance to radius of the journal (all of 
these quantities are related to the load capacity). Note that the 
left-hand side of Equation (49) is similar to the reciprocal of the 
Sommerfeld variable with the feed pressure P, replacing the load 
per projected area W’. 

Temperature and Viscosity of the Oil. A simple method suggested 
by Wilcock and Rosenblatt [7] for estimating the outlet tem- 
perature and viscosity of the oil of a bearing is to equate the 
power dissipated in keeping the journal rotating to the energy 
(heat) picked up by the oil flowing out of the bearing. This gives 
an expression for the viscosity in terms of the outlet temperature 
T: and other parameters of the journal-bearing system. A 
plot of uw versus 7; is then constructed. Superposed on this 
is a plot of mw versus 7’ of the oil used. The intersection of 
these curves gives the value of the outlet viscosity and tempera- 
ture. If the journal bearing is considered as a ‘‘mixer”’ where some 
oil at viscosity yu; goes in end some oil with a viscosity 2 goes out, 
then 2 can be considered as the viscosity of the oil in the bearing. 
This viscosity pu: is used to calculate the power loss of the bear- 
ing and the load of the bearing. 

The power dissipated (H) due to friction is equal to the product 
of the friction force on the journal times the peripheral speed of 
the journal 


H = FU = fwu 
The energy acquired by the oil flowing out of the bearing is 
QyC, AT 


where ¥ is the weight density of the oil, C, is the specific heat, and 
AT is the temperature rise. Equating the two, rearranging, and 
taking into account the difference in units (J), an expression is 
obtained for u in terms of the parameters of the bearing 


2 3b? 
(: + yC, ATS 


(50) 


(51) 


Values for S and () f are obtained from Equations (41) and 


(42) for a fixed value of b. 

The bearing characteristics have been determined. It is seen 
that by modifying the classical method of separation of variables, 
a solution is obtained for Reynolds equation which leads to simple 
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equations for the characteristics of a journal bearing with cireum- 
ferential groove and complete oil film. 


Discussion of Results and Conclusions 


Discussion. Now that an exact solution for the full journal 
bearing with a continuous oil film has been obtained, it is interest- 
ing to examine the results in regard to friction and load capacity. 
In addition, compare results with other published solutions. 

Fig. 3 shows a plot of (r/c)f versus S for a few length-to-diame- 
ter ratios. A Petroff line [(r/c)f = 27S] is also included for eom- 
parison. A salient point of the curves is the ordinate intercept. 
All of the complete-oil-film friction curves for journal intercept the 
(r/c)f axis at 1. Examination of Equation (41) shows that 
all friction curves for the bearing go through the origin when b 
is equal to 1 (S = 0). 


10; 
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Fig. 3 Coefficient of friction (r/c)f versus Sommerfeld variable S$ for 
journal 


Fig. 4 compares leakage factors (W/W...) obtained from the 
exact continuous-oil-film solution with results published by 
Muskat and Morgan [1] for values of b up to 0.6. Muskat and 
Morgan values are based on an approximate solution expanded 
in powers of b valid for less than 0.6. It will be noticed that there 
is good agreement. It is interesting to note that all finite journal 
bearings have load capacities less than that for the infinite bear- 
ing, as may be expected. This is especially true for short bearings 
operating at low eccentricities. But as b approaches 4, all finite- 
bearing load capacities approach the load capacity of the in- 
finite bearing. 

To aid calculations for a specific journal-bearing application, 
values of a for 0 < b < 1 are given in Table 1. 

Conclusions: It has been demonstrated that a simple analytical 
solution of Reynolds three-dimensional equation can be ob- 
tained for journal bearings with circumferential grooves operating 
with a continuous oil film. The method employed makes possible 
the analytical solution of other journal-bearing problems that use 
the Sommerfeld boundary condition. 
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Fig. 4 Relative load-carrying capacity w/ W.. versus eccentricity b are 
compared with results of M. Muskat and F. Morgan 
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Table 1 Values of a versus b 
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Bearing Oil-Ring Performance 


A series of tests was made with rings ranging from 2°/, to 16'/, in. ID and at 


journal speeds up to 4000 fpm. Correlation of the results provides means for calculating 
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01. rings have now been used for the lubrication of 
bearings for over a century. Because of their simplicity, relia- 
bility, and low cost, they have been installed in a great variety of 
rotating machinery over this period. Applied initially in rela- 
tively low-speed equipment, their range of use has continually 
been extended up to present limiting journal surface speeds of 
about 3000 to 4000 fpm. 

Past studies have established many of the details of ring beha- 
vior [1, 2, 3, 4, 5}.1. The rotation of the ring depends on a pro- 
pulsive force between the rotating shaft and the ring. At low 
speeds the static friction between the ring and journal is suffi- 
ciently high for the ring to be driven with its surface speed equal 
to that of the journal. At higher speeds slip occurs and the driv- 
ing force is transmitted through an oil film which is analogous in 
many ways to that in a journal bearing. Modifications of the 
geometry of the inside circumference of the ring have been used in 
some cases to develop a higher friction coefficient and thus a more 
positive drive [3]. Opposing the ring rotation is the drag of the 
lower portion of the ring in the oil reservoir, the force required to 
lift the oil up from the reservoir to the top of the journal, and the 
frictional drag on the ring by any close-running stationary sur- 
faces such as the sides of the ring slot in the bearing. 

1 Numbers in brackets designate References at end of paper. 

Contributed by’ the Lubrication Division of THe American 
Society or MecuHanicaL ENGINEERS and presented at the 
ASLE-ASME Lubrication Conference, New York, N. Y., October 
20-22, 1959. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 


the Society. Manuscript received at ASME Headquarters, July 8, 
1959. ASME Paper No. 59—Lub-5. 


ring speed for any journal speed. Oil viscosity, ring weight and diameter, ring immer- 
ston, and journal diameter are taken into account. 

From this calculated ring speed, oil delivery can then be estimated from equations 
incorporating the effect of oil viscosity and related operating factors. 


Oil-delivery mechanisms have also been considered. The oil 
lifted from the oil reservoir as 4 coating has been observed to be 
delivered to the journal largely from the inside surface of the ring 
by a squeezing action as the ring passes over the rotating journal 
[1, 2}. At low speeds some oil from the sides of the ring was 
found to be available, but this quantity dropped off rapidly as the 
centrifugal force threw out the oil at higher speeds. In the highest 
speed range of operation, the centrifugal force throws off as a 
spray most of the oil lifted from the reservoir [1]. Hersey [6] and 
more recently Ozdas and Ford in a comprehensive study of oil 
rings [5) devised dimensionless correlations of oil delivery in the 
full-film region of ring operation with some of the many factors 
involved. 

These past studies have done much to provide an understand- 
ing of the fundamentals of oil-ring operation. The present analy- 
sis is directed particularly at obtaining quantitative expressions 
for ring speed and for oil delivery as a function of the oil charac- 
teristics, the geometry of the ring and journal, and the bearing 
operating conditions. 


Equipment and Test Procedure 


To study the performance of a number of different oil rings 
without the influence of the bearings which they ordinarily feed, 
the equipment shown in Figs. 1 and 2 was assembled. Fig. 1 is a 
view of the apparatus used for tests on small rings (2.75 to 6.75 in. 
diam), and Fig. 2 shows the arrangement used for large rings 
(6.75 to 16 in. diam). 

In Fig. 1 can be seen a 4.50-in-diam ring running on a 2.25-in. 
journal. Shown are the ring guide and scraper atop the journal, 
the scoop and trough for collecting the oil brought up by the ring 


Nomenclature 


= journal diameter, in. 

= inside diameter of oil ring, in. 

= coefficient of friction 

= acceleration of gravity, 386 in/sec? 

= ring immersion, depth of ring inner diameter beneath 
the static oil level, in. 

k = constant 


0.87 
full-film speed characteristic, (DV /v) 


Da 


> 


no-slip characteristic, u4V,D/W 

journal rotational speed, rps 

ring rotational speed, rps 

length of periphery of oil-ring cross section from which 
oil is delivered, in. (equals inner width for unscraped 
ring) 
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= 


Q = oil delivery, in.*/sec 
Q, = oil delivery at delivery transition speed V,, in.*/sec 
v = journal surface velocity, ips 
V = ring i.d. surface velocity, ips 
vo, Vo = minimum surface velocity of journal and ring for full- 
film ring operation, ips 
v,, V, = maximum surface velocity of journal and ring for no- 
slip operation, ips 
v, V, = ring oil delivery transition speeds, ips 
W = weight of oil ring, lb 
= viscosity, Reyns, lb sec/in.? (equals centipoises 
multiplied by 1.45 X 10-7) 
kinematic viscosity, in.*/sec (equals centistokes mul- 
tiplied by 1.55 & 10~*) 
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from the reservoir, the thermometer for measuring the tempera- 
ture of the oil, the oil-supply pipe, and the oil tail on the ring 
which always accompanies operation of oil rings. The location of 
the weir for controlling the level of the oil in the reservoir is indi- 
cated, but the weir itself is not visible. An oil-supply pump and 
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sump tank were also used for controlling the oil level and tempera- 
ture. 

In Fig. 2 a 16-in. oil ring is running on a 10-in. journal in the 
large ring apparatus. Visible are the oil collector, the splash 
cover, the reservoir, and the scissors jack for adjusting the oil 
reservoir and level to the desired submersion on the ring. 

The shapes and dimensions of the test rings are indicated in 
Fig. 3. Numerical dimensions of the rings are given in Table 1. 
With the exception of the specially machined laminated-phenolic 
ring, all the rings were taken from production stock for electric 
motors. The trapezoidal-section zinc rings were die castings, and 
the T-section brass rings were made from rolled stock riveted 
together as shown in Fig. 3. The diameters of the steel journals 
run with each ring are also given in Table 1. 

Table 2 summarizes the physical properties of the lubricant used 
in the studies, a medium turbine oil. 

After the chosen journal-and-ring combination had been 
mounted in the apparatus with the oil collectors and ring guides 
in place, the oil temperature was stabilized and the ring-submer- 
sion level was adjusted. Tests were usually started at the lowest 
journal speed, 40 rpm. The oil issuing from the collector was di- 
rected into a graduated cylinder for an integral number of min- 
utes to determine oil-flow rate, and the rpm of the ring was deter- 
mined by counting the passages of a white mark on the ring. 
The speed of the journal was measured by a stroboscope. The 
journal speed was then increased stepwise by means of a variable 
drive, with a repetition of the oil-collecting and speed-measuring 
procedure, to 1800 or 3600 rpm or until the oil tail of the ring ex- 
ceeded the boundaries of the reservoir. A few tests on small rings 
were run to 7000 journal rpm. 

In the tests on small rings, three immersion levels were studied 
for each ring-and-journal combination. Tests were run at three 
temperatures for one immersion level for each combination. 
Tests at the lowest temperature, 100 F, were run first; then the 
oil would be heated next to 140 F and then 170 F for the subse- 
quent series of tests. The maximum attainable temperature in 
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; >. on Fig. 1 Small ring apparatus 
Fig. 2 Large ring apparatus 


Table 1 


Height, 


Inner diam, Inner 


ng 
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All dimensions in inches. See Fig. 3 for symbols used. 


(a) 
TRAPEZOIDAL 
SECTION 
RING 


(b) 
T- SECTION 
OIL RING 


RING SPEED, RPM 


500 1000 


JOURNAL SPEED, RPM 
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1/4" IMMERSION | 
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4---4 3/4 
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1000 
JOURNAL SPEED,RPM 


Fig. 4 Effect of depth of immersion (4 1 /2-in. diam zinc trapezoidal sec- 
tion ring, 2 1/4-in. diam journal, 140-F ofl temp) 
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1500 


Outer width, Thickness, 
a t 


Ring dimensions and combinations 


Section 
radius, _ Journal 
r 


lb diameters, d 


Material 
Zine 
Zine 
Phenolic 


Table 2 Oil properties 
Viscosity 
—— Kinematic——~ Absolute, 
centistokes _in.*/sec Ib sec/in.* 
0.114 9.30 X 
0.0431 3.45 X 10-8 
0.0294 2.34 X 
0.0248 1.96 


Density, 
gm/ce 
0.870 
0.855 
0.849 
0.843 


the large ring tests was 160 F because of the greater volume of oil 
circulating through the system. 


Factors Affecting Oil Delivery 


One of the requisites for satisfactory operation of a fluid-film 
bearing is an adequate lubricant supply. For the oil-ring bearing 
a vital question is always “Will the ring deliver enough oil?’’ 
How much is “enough’’ may be estimated by recently developed 
methods of hydrodynamic analysis [7, 8]. The amount of oil de- 
livered by a ring has been observed for a wide variety of condi- 
tions in this study. Several specific factors influencing delivery 
will be reviewed first; then a generalized correlation of these ob- 
servations will be considered in later sections. 

Depth of immersion. Fig. 4 illustrates the decrease in ring speed 
and decrease in oil delivery invariably found here and in previou ; 
studies [1, 3, 5] to accompany an increase in submersion. So long 
as the ring is sufficiently below the level of the oil in the reservoir 
that the inner surface is never exposed at any speed, oil delivery 
is not drastically changed with reasonable changes in oil level. 
For the variation in level experienced in bearings in actual ser- 
vice, i.e., from a minimum immersion of about 10 per cent of ring 
diameter to the maximum immersion of about 20 per cent, the 
change in oil delivery found in these tests was always less than 20 
per cent and generally about 10 per cent or less. 

Oil Viscosity. The effect of changes in oil viscosity on ring 
speed and oil delivery has also been extensively studied [1, 2, 3, 4, 
5]. As the temperature of the oil increases, its viscosity de- 
creases, and the quantity of lubricant delivered by the ring 
decreases accordingly. Data illustrating this phenomenon are 
plotted in Fig. 5. The viscosity of the oil affects the speed of the 
ring to a much lesser extent than it affects the oil delivery. This 
will be discussed in more detail in subsequent sections where em- 
pirical equations relating oil delivery and ring speed to viscosity 
are developed. 

Guides and Scrapers. Perhaps the most striking observation 
made was the severalfold increase in oil delivery with the use of 
guides and scrapers to remove the oil from the sides and top of the 
ring and to keep the ring centered in the ring slot of a bearing. 
In separate tests on a complete bearing assembly, a better than 
50 per cent increase in ring speed was realized when a bent paper 
clip was inserted between the ring and the sides of the ring slot, 
eliminating the long viscous drag length normally encountered 
whenever the ring touched either side. The use of such a guide 
more than doubled the oil made available to the bearing, first 
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because of higher ring speed and second because of the scraping 
action of the guide on the sides of the oil ring. Although a scraper 
slightly reduces the speed of a free-running ring as shown in Fig. 
6, the oil delivery increases by a factor of three or four. Correla- 
tion of the data shows the amount of oil delivered to be a direct 
function of the length of the ring cross-section periphery from 
which the oil has been scraped or squeezed. The clearance be- 
tween ring and scraper should be as small as possible and still 
permit unimpaired operation of the ring at all speeds. Theo- 
retically the clearance may be optimized for maximum delivery, 
but for the rings studied this optimum was less than the 0.010 
to 0.015-in. clearance required for unimpaired operation. 

Material. Another variable affecting the operation of the ring 
is the material from which it is made. Fig. 7 gives com- 
parative ring speeds and oil deliveries for a die-cast zine ring and 
for a ring machined from a laminated phenolic plastic. Both 
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Fig. 5 Effect of temperature (5 5/8-in. diam trapezoidal section zinc 
ring, 2 1/4-in. diam journal, 7/8-in. immersion) 
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Fig. 6 Effect of scrapers (5 5/8-in. diam trapezoidal section zinc ring, 
2 1/4-in. diam journal, 7/8-in. immersion, 100-F oil temp) 
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the ring speed and oil delivery for the plastic ring are less than 
those for the zinc ring. Also the equilibrium position of the light 
ring shifts to such an extent that the ring could rub on the bearing 
liner or on the housing in an actual installation. All past investi- 
gators agree that lightweight rings do not operate satisfactorily. 
One comments [1], “Light rings, sometimes advocated by de- 
signers, have no justification.’’ 

Out-of-Roundness. Design tolerances on out-of-roundness are 
generally quite stringent. Surprisingly, however, the oil deliv- 
ered and the ring speed for free-running rings were only slightly 
affected by a considerable amount of ellipticity as shown in Fig. 
8. A 16.5-in-diam ring 1.0 in. out-of-round (an ellipse with a 
major axis of 17.0 in. and a minor axis of 16.0 in.) ex- 
hibited ring speeds and oil deliveries 80 per cent of those for a ring 
only 0.012 in. out-of-round. Out-of-roundness up to 2 per cent of 
the ring diameter has no appreciable effect on ring speed or oil 
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Fig. 7 Effect of ring material (4 1/2-in. diam trapezoidal section zinc 
ring, 2 1/4-in. diam journal, 1/2-in. immersion, 100-F oil temp) 


100 


RING SPEED ,RPM 


JOURNAL SPEED, RPM 


| 
/ 
a 1000" S07 
© 02 000" OuT-oF. 
| | 
| 
| 
0 200 400 600 800 


JOURNAL SPEED, RPM 


Fig. 8 Effect of ring out-of-roundness (16 1/2-in. diam T section brass 
ting, 10-in. diam journal, 3 1/2-in. immersion, 100-F oil temp) 
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delivery, but the larger ellipticities do make the rings swing in a 
manner which could impair their operation in a bearing. The 
out-of-roundness tolerances on rings should be established by the 
limit of this oscillation which can be tolerated in the bearing 
housing. 


Oil-Delivery Ring-Speed Relationship 


The most important factor affecting the oil delivery of an oil 
ring is the speed of the ring itself. As the speed of the ring is in- 
creased, the quantity of oil delivered increases—up to a point. 
When the velocity of the ring becomes too large, centrifugal force 
flings the oil off the ring and it is lost for lubrication purposes. 
Below this velocity, however, the relationships between oil de- 
livery and ring speed are remarkably consistent. 

Low Ring Speed. In Fig. 9 the oil delivered per length of ring 
cross-section periphery from which the oil is scraped or squeezed is 
plotted against ring velocity for a variety of rings with diameters 
from 2.75 to 16.50 in. with trapezoidal and T cross sections, 
scraped and unscraped, and with various immersion levels. 
With the exception of a few renegade points, the correlation is 
surprisingly good below the transition speed. The individual 
points plotted in Fig. 9 and the solid line are for an oil temperature 
of 100 F. Also drawn on the plot are broken lines representing 
the results of tests at 140 and 170-F oil temperature. The slope of 
all these lines on the log-log plot is the same, namely, 1.5. The 
intercepts are a function of temperature as it affects the lubricant 
viscosity. From Fig. 9 the following empirical expression is ob- 
tained for specific oil delivery as a function of ring velocity V and 
oil viscosity v 


Q = 


Q = DN)" (1) 


where Q is the cubic inches of oil delivered per second. For an 
unscraped ring, p becomes simply the width of the ring riding on 
the journal. Although this equation holds for essentially all of 
the tests made and is not influenced by ring immersion, several 
factors are not included which exerted an influence. At the same 
ring speed a lighter weight phenolic ring gave lower oil delivery 
than a zine ring. Heavier ring weight apparently gave a thinner 
oil film between the ring and the shaft with more oil consequently 
being squeezed out from under the ring for delivery to a bearing. 
Surface tension of the oil probably plays a role also, but it was not 
evaluated here. such variables seems 
necessary to give a complete correlation of factors in Eq. (1) on a 
dimensionless basis. 

Transition Speed. A transition ring speed, designated as V, in 
Table 3, exists above which oil delivery no longer increases in the 
manner described in Eq. (1). 


Introduction of some 


A likely cause of this transition in 
delivery is the increased centrifugal action which reduces availa- 
bility to the journal of oil exuded from between the ring and 
journal [2]. A dimensionless centrifugal-force term V is 
shown to vary with a dimensionless term containing viscosity in 
Fig. 10 for the transition ring speed data of Table 3 for free-run- 
ning (unscraped) rings as follows 


gD 


Substituting the numerical value of g = 386 in./sec.* 


(2) 


Dis 
V, = 0.20 —-; N, = 0.063 — (3) 
y-67 


This value of V, is strongly influenced by the geometry of the 
system. The use of scrapers, for instance, approximately tripled 
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Fig. 9 Variation in oil delivery with ring speed (experimental points 
for 100 F only with 140 and 170 F trend lines given for comparison) 
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Fig. 10 Ojil-delivery transition speed correlation 


the values of V, in Table 3 for the smaller ring sizes listed. A 
scraper still effectively removed the oil from the sides and outer 
ring surfaces in the volume given by Eq. (1) up to these much 
higher ring speeds. Eqs. (2) and (3) reflect the over-all influence 
of a number of factors since they are empirically obtained from 
experimental results. Sufficient data have not yet been obtained 
to separate the individual effect of each variable. 

High Ring Speed. Above this transition speed, oil delivery in- 
creases much less rapidly with increasing ring speed. Surpris- 
ingly, delivery drops in this region for increasing journal speed. 
This was observed in a number of tests where the ring speed re- 
mained essentially constant over a wide range of journal speeds in 
the partial-film drive range of ring operation. Delivery varies as 
the square-root of the oil viscosity. The trends in the various 
factors involved were reasonably well related in most cases by a 
dimensionless oil-delivery term and a journal Reynolds number 
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Table 3. Transition speeds in oil-ring operation 


Delivery 
Oil transition No-slip No-slip —Start of full-film drive— 
Ring diam, Journaldiam, Immersion, visc., speed, limit, charact., 
D, in. d, in. I, in. microreyns ips wv, = V,,ips k, = »pV.D/W ips ips 
Unscraped rings 

4.50 2.25 0.25 9.30 13 6 0.0019 5 17 2.8 
0.50 9.30 11 5 0.0016 4 13 2.6 

0.75 9.30 12 5 0.0016 4 14 2.8 

5.625 3.00 0.875 1.96 17 23 0.0010 21 70 3.0 
5.625¢ 3.00 0.875 3.45 31 24 0.0011 19 6 40 
1.96 36 34 0 Zt 29 140 4.1 

6.75 3.00 1.00 9.30 22 16 0.0015 9 39 2.4 
2.34 25 33 0.0008 18 100 3.3 

10.002 6.00 1.00 9.30 28 38 0.0020 27 110 2.9 
2.34 69 67 0.0009 

16.50 10.00 3.50 9.30 52 72 0.0022 49 240 3.3 
2.34 121 340 0.0026 620+ 

17° 10.00 2.25 17.4 ves 16 0.0011 11 42 2.6 
5.08 115 0.0024 44 230 20 

27° 16.00 2.50 17.4 56 0 0016 44 142 2.5 
5.08 126 0.0011 51 300 2.4 

Seraped rings 

4.50 2.25 0.50 9.30 32 4 0.0013 3 10 2.5 
3.45 33 11 0.0013 10 35 3.2 

1.96 38 26 0.0017 15 54 2.1 

5.625 2.25 0.875 1.96 48+ 33 0.0015 22 100 3.0 
Average 0.0015 2.9 


* T-section ring. 
> From Reference [1]. 


for which the general form was suggested by Hersey [6]. 


Q (: 0.8 
ppv * 4 
By applying this equation twice, the delivery Q for any ring 


speed V and shaft speed v in this high-speed range can be related 
to the transition speed as follows 


For such a calculation V, is obtained from Eq. (3), with a corre- 
sponding v, from Eq. (9) as discussed in the next section, and with 
Q, from Eq. (1). For operation in the full-film region where Eq. 
(9) holds, Eq. (5) becomes simply 


At the top of this high-speed range the throw-off of oil from both 
the ring and the journal becomes so vigorous that the oil is no 
longer available for use in a bearing. Frequent and sometimes 
sizable deviations were encountered for the oil-delivery relations 
suggested for this high-ring-speed range, and further study of the 
factors involved would be desirable. Further analysis of the oil- 
delivery transition speed should be particularly fruitful. 


(4) 


(5) 


(6) 


Ring-Speed Journal-Speed Relationship 


The variation in ring speed observed with variations in journal 
speed almost always shows three distinct regions such as can be 
seen in a number of the plots in Figs. 4 to 8. At low journal 
speeds the velocity of the inside circumference of the ring is iden- 
tical to that of the OD of the journal. At some point during an 
increase of journal speed through this no-slip region, the static 
friction between the ring and journal is no longer sufficient to 
maintain synchronous ring speed. Beyond this journal speed a 
region of slower ring rotation is encountered in which the drive of 
the ring is partially through metallic contact and partially by a 
fluid film. As the journal speed goes still higher, a full-film region 
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is finally reached in which the ring is driven entirely by hydro- 
dynamic action through an oil film. 

Tabulated in Table 3 are the transition points which were ob- 
served as the ring speed passed from one zone of operation to 
another with increasing journal speed. In most cases these zones 
of operation and their transition points were quite well defined. 
With smaller rings, however, the region of nonslip operation 
terminated at such low journal speeds that it was not always 
evaluated. On the other hand, with the 16'/,-in. ring at 160 F 
the region of full-film drive was not entered throughout the test 
range which extended up to about 600 ips (3000 fpm). 

No-Slip Operation. The ring speed in no-slip operation is quite 
well defined. The ring operates synchronously with the journal 
and the surface speeds of the two in contact are equal. 

The upper speed limit for no-slip operation can be expressed 
through a balance of the static driving friction with the drag 
forces acting on the ring. The coefficient of friction between the 
journal and the ring multiplied by the ring weight will give the 
driving force on the ring which can then be visualized as being 
exactly equal at the no-slip limit to viscous-drag forces in the fol- 
lowing relation 


JW =kyuV,D 
Ww 


(7) 
The slip characteristic k, averages 0.0015 in Table 3 and is rela- 
tively constant with either the trapezoidal-cross-section rings or 
the T-section rings. From the above equation the top ring ve- 
locity for no-slip operation is seen to increase proportionately 
with ring weight and to decrease with increasing viscosity of the 
oil and diameter of the ring. 

Despite the relative constancy of the slip characteristic k, in 
these tests, past investigations have shown that the limit of opera- 
tion without slip is also influenced by other factors. V, can be 
increased severalfold by such modifications as grooving the inter- 
nal surface of the ring [3] or by increasing roughness of the inside 
surface of the ring [5]. The slip speed also varies with the ring 
immersion and is higher the less the amount of drag of the ring 
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Fig. 11 Reynolds number relationship for oil ring drive 

in the oil reservoir. Scraping oil from the sides and top of the 
ring had relatively little influence on the limit of no-slip running 
for the tests listed in Table 3. 

Partial-Film Drive. Once the ring has started to slip with in- 
creasing journal speed, a decreased driving force is provided by 
the partial oil film generated between the ring and the journal. 
As can be seen by a comparison of V, and Vo in Table 3, the ring 
speed drops off very little with smaller rings in this region of a 
partial-film drive. With large rings, however, the speed decrease 
is quite marked. This is apparently true with large rings because 
of the great difference in speed between the limit in no-slip opera- 
tion V, and the full-film ring operating speed which would exist 
at the same journal speed. 

As the degree of slipping between the ring and the journal sur- 
face increases with the increasing journal speed, a point is finally 
reached at which a complete lubricating-oil film is built up to sup- 
port the ring. This change to full-film operation was given for 
the tests in Table 3 by the average relationship 


= 2.90, 


(8) 


The ring-speed journal-speed relationship in this partial-film 
operating region was reasonably represented for most of the tests 
by simply connecting the points v,, V, and m, Ve by a straight 
line in a logarithmic plot of ring speed versus shaft speed. 

Full-Film Drive. When this region is entered, the ring speed again 
begins to increase with increasing journal speed. In the experi- 
mental observations, the ring speed generally increased with 
higher oil viscosity. Ring speed was also a factor of the diame- 
ter of both the journal and the ring. These various factors seemed 
satisfactorily related by comparing a Reynolds number for the 
driving action of the shaft with the Reynolds number for the ring 
in Fig. 11, which Jeads to the following relationship 


(nd?)p-% 


V = 2.10 D (dvp-™; N = 1.67 


(9) 
This equation is based on an immersion of the ring in the 
oil in the reservoir corresponding to 15 per cent of the ring di- 


ameter. For different ring immersions, the ring speed varies in- 
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versely on an average as the 0.20 power of the ring i: mersion. 
This results in the following ring-speed characteristics for various 
ring immersions. 


Fractional 
ring immersion, //D 
0.05 
0.10 
0.15 
0.20 
0.25 


Ring-speed 
ky 
2.62 
2.28 
2.10 
1.98 
1.90 


Several other factors also influence ring speed. With scrapers 
present to take oil from the sides and top of the ring, the ring 
speed drops approximately 20 per cent. Rubbing against the 
sides of the ring slot in an actual ring-oiled bearing, the drag 
caused by a keeper placed over the top of the ring slot to keep the 
ring in place, and the influence of other adjacent parts in a bear- 
ing housing all serve to reduce the oil ring speed. The speed of a 
ring in a complete bearing assembly may drop anywhere from 5 
or 10 per cent up to 50 per cent or more below the speed of a free- 
running ring. 


Summary 


The following relationships summarize the performance char- 
acteristics of oil rings observed in this study. The ring-speed 
shaft-speed relationships, listed for increasing journal-speed 
ranges, are as follows: 

Range Ring-speed equations 
No-slip drive V =v; N = nd/D 
Start of slip V, = 0.0015 W/uD; N = 0.00048 W/uD? 
Partial-film drive (See text) 
Start of full-film 
operation 


Full-film drive 


te = 2.90, 


V = 2.10 (dv p-*; 


N = 167% 


(nd2)p-8 

Once the ring speed is determined from these relations for a 
given operating condition, use can be made of the following oil- 
delivery equations. 


Range 
Low ring speed 
Transition speed V, = 0.20 


Oil-delivery equations 
Q = Q = 
Des 
High ring speed Q = Q 


Some idea of the utility of these relations becomes apparent 
when they are applied to data from two previous investigations 
in Table 4, although the equations given were used with their 
units of lb, in., and sec, the comparison is provided in terms 
of the common rpm and gpm units. The ring speed and 
oil delivery were calculated from simply the shaft speed and re- 
lated factors by the procedure already outlined. 

For the 27-in. ring data of Karelitz [1], the oil-delivery transi- 
tion ring speed was calculated from the appropriate equation 
to be 55 rpm. Below this speed, for 200 and 400 journal rpm, 
oil delivery was calculated from the calculated ring speed to- 
gether with the oil-delivery equation for low ring speed. The 
last two cases from Karelitz are for journal speeds up to 3350 fpm, 
approaching the highest speeds normally encountered in ring- 
oiled bearing operation. For these the calculated oil delivery was 
obtained from the high-ring-speed oil-delivery equation. 

A similar procedure was used for the 6.3-in. ring data of Ozdas 
and Ford [5]. Here, however, the oil-delivery transition speed 
was approximately 130 ring rpm as judged from the reported 
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Table 4 Comparison with data from past investigations 
Journal Oil 
diam, Immersion, vise, 

i i centistokes 


— Oil delivery, ——~ 


Cale. 


Journal 
rpm 


in. in. in. 
From Karelitz {1} 
27 16 


2.5 200 
400 


From Ozdas and Ford [5] 
6.3 3.94 


data, while the calculated value was only 66 rpm. The higher 
transition speed obtained from the experimental data was used 
together with the high-speed oil-delivery equation for the last two 
oil-delivery calculations in Table 4. 

The ring-speed relationships held well in Table 4. Oil delivery 
is of the right order of magnitude, but improvement in the means 
for estimating the oil-delivery transition speed seems desirable. 
Where a higher order of accuracy is required, a few test observa- 
tions with an actual bearing installation will establish more closely 
the numerical constants in the equations given for the system 
under consideration. 
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Bearings 


Torque Produced by Misalignment of 
Hydrodynamic Gas-Lubricated Journal 


A perturbation analysis is used to find the torque generated by a misaligned journal 
bearing (or, equivalently, to find the angular displacement of a torque-loaded journal). Re- 
sults are presented graphically in terms of nondimensional parameters. 


Introduction 


I. SOME APPLICATIONS of gas-lubricated jour- 
nal bearings, the misalignment torque or angular stiffness of 
the bearing is of interest. For example, a very high angular stiff- 
ness is required for gyroscope rotor bearings in order to accurately 
define the spin reference axis direction. Misalignment torque 
is also important in the design of self-aligning journal bearings. 
Here one is interested in determining how much friction or tor- 
sional stiffness can be tolerated in the pivoting mechanism. 

An approximate analytical solution for the misalignment 
torque of hydrodynamic, gas-lubricated journal bearings is given 
in this paper. The general method follows the perturbation 
analysis employed in earlier papers {1 and 2].! Resulting charac- 
teristic curves are presented in Figs. 3 and 4. A numerical 
example gives a misalignment torque of 0.16 lb-in. per sec of arc 


‘Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division of THe AMERICAN 
Society or MecHanicaL ENGINEERS and presented at the ASLE- 
ASME Lubrication Conference, New York, N. Y., October 20-22, 
1959. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, February 
3, 1959. Paper No. 59—Lub-3. 


Nomenclature 


misalignment. This compares favorably with similar-sized pre- 
loaded ball bearings for gyroscope rotors. 


Statement of Problem 


The problem is to calculate the pressure distribution and torque 
resulting from an angular misalignment of a gas-lubricated jour- 
nal bearing. Fig. 1 shows a journal of radius r rotating at an 
angular velocity w within a stationary bearing of radius r + c and 
width (or length) b. The axes of the journal and bearing intersect 
at an angle 6 midway between the two ends of the bearing. In 
other words, the journal is not translated relative to the bearing 
but is merely rotated or misaligned. 

The bearing is operating in an atmosphere of gas at ambient 
pressure po. As the journal rotates it drags this gas through the 
gap h between the bearing and the journal, and sets up hydrody- 
namic (more properly aerodynamic or fluid dynamic) pressure 
forces on the journal and its bearing. For equilibrium, the inte- 
gral of the pressure forces must be equal and opposite to the total 
load. In the case under consideration, the load is a torque of 
moment M. In general, this torque M will be applied at some 
angle y to the axis of the sngular deflection of the journal and 
will have components M, about this axis of angular deflection 
and M; about an axis perpendicular to the angular deflection. 

Since the radius of curvature r is much greater than the gap 
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local thickness of lubricating gas 
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width A the action of the air film between the two bearing sur- 
faces can be described by slider bearing theory. The Reynolds 
equation for slider bearings with lubricant density proportional 
to p'* and constant viscosity is [1] 


where p is the pressure of the lubricant, U is the velocity of the 
moving surface in the z-direction, 9 is the absolute viscosity of 
the gas lubricant, and k is its coefficient of polytropic expansion. 

Theoretically, it is most reasonable to assume isothermal lubrica- 
tion (k = 1), but practically it has been found that k = y, where 
7 is the ratio of specific heats, often gives better agreement with 
experimental data [2] on journal bearings. 

Equation (1) is more conveniently applied to journal bearings 
by the substitutions 02 = rd¢, y = rf, and U = rw. With these 
substitutions and division by po'/*c*, Equation (1) becomes 


where 


Perturbation Equations 

A first-order perturbation solution to Equation (2) will be ob- 
tained following the same treatment employed previously by the 
author [2]. In brief, the gap width is very closely approximated 
by 


h=c+rficoso (3) 


and the pressure is assumed to be expandable in a power series of 
increasing order in the misalignment angle 6. 
= pot dn + + +... (4) 


If Equation (3) and (4) are substituted into Equation (2), and 
terms are grouped according to powers of 5 the following series of 
linear partial differential equation results: 


_ Hkporf 
of? c 


sin 


(5) 
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Sketch of misaligned journal bearing 


H ad + at? sin 

casing 2H cos >) Ot cos 


1 dp: dp: \* 
im + © 

The first equation can be solved for p; which can then be sub- 
stituted into the right-hand side of Equation (6) permitting (6) to 
be solved for p:. In theory, this procedure can be carried out to 
obtain higher and higher order terms in the pressure series. Prac- 
tically, however, the equations rapidly become so complicated 
so as to force termination of the solutions after only one or two 
terms. 

Mathematically, it is very difficult if not impossible to prove 
convergence of a perturbation technique such as this. However, 
the same method has been employed in other problems of this 
type, and it has been shown to give good approximations to exact 
numerical solutions in the case of infinitely long journal bearings 
[1] and to experimental results in the case of finite length journal 
bearings [2]. As long as the results describe the physical process 
to engineering accuracy, the question of convergence is academic. 

At each step in the solution the pressure is required to satisfy 
the following boundary conditions: 


1 The pressure must be periodic in $; that is, 
= + 27) 


2 From dynamic and geometric symmetry, the pressure for 
positive ¢ must equal the pressure 180 deg around the bearing at 
negative ¢; that is, 


3 The pressure around the two edges of the bearing must equal 
the ambient pressure; that is, 


+5) = Po 


Perturbation Solution 
Let us try a solution for Equation (5) of the form 
pi = Aig sin + Axf cos d + Z(f)- (7) 
where A, and A; are constants, Z is a function only of {, and ® is 
a function only of @. The form of Equation (7) is suggested by: 
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(a) The symmetry requirement for p,; (b) the periodicity require- 
ment; and (c) the sinusoidal term on the right-hand side of Equa- 
tion (5). When Equation (7) is substituted into Equation (5), we 
obtain 


— HZ®’ + = (4. ~ ae 
+ (HA; + cos@ (8) 


In order to obtain a homogeneous, separable equation in Z and ®, 
we choose A; and A; such that the right-hand side of Equation (8) 
is identically zero. With this choice for A, and As, Equation (7) 
becomes 


Hkpor 


Pi 


and Equation (8) separates into the two ordinary differential 
equations 


— Kb =0 (10) 


— KZ =0 (11) 


where K is the separation constant. 
Solutions to Equation (10) and (11) which satisfy the first two 
boundary conditions (periodicity and symmetry ) are 
= Be'* = (B, + iBs)cos + i sin >) (12) 


Bind 


and Z = Ce+ VX! = 2C [sinh (at) cos (Bf) 
+ icosh (af) sin (8F)} (13) 
2 2 
K =1+iH, and i=+/-i 


Multiplying Equation (12) and (13) together and retaining either 
the real part or the imaginary part of the product, we obtain 


Z® = E, [sinh af cos Bf cos @ — cosh af sin BF sin d] 
+ E; [cosh af sin Bf cos ¢ + sinh af cos Bf sin ) (14) 
The constants EZ, and E; are fixed by the third boundary condition 


where 


m= = +7). 
(9)] 


This condition is met if [see Equation 


(H cos @ — sin 


(15) 
when 


By setting £ = +b/d in Equation (14) and equating the coeffi- 
cients of sin dand of cos @ with those in Equation (15), we can solve 
for E, and E;. With the resulting values of Z, and E:, Equation 
(14) is then substituted into Equation (9) to obtain the first 
order perturbation pressure as a function of [ and @. 


+ 


sinh (af) cos (Bf) 


cosh (af) sin (Bf) | sin 


cosh (at) sin (BE) H 


Two pressure distributions calculated from Equation (16) are 
illustrated in Fig. 2. 

According to the perturbation technique outlined earlier, we 
should now take p; as given by Equation (16), substitute it into 
the right-hand side of Equation (6) and solve for p:. The full 
solution for p: is quite lengthy and for our purpose it is sufficient 
to realize that the form of p: must be 
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sinh (af) cos (BS) 


cos } 


(18) 


= Silt) + ff) sin 2 + cos 


where f,, fz, and f; are functions of £. The important thing to 
observe is that p: contains no first harmonic terms in ¢@. As we 
shall see shortly, this means that p; contributes no support to 
the bearing torque load. 


(17) 
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Bearing Misalignment Torque 


The final step is to determine the journal restoring torque or 
moment M resulting from an angular displacement 6 of the jour- 
nal. This is also equal in magnitude but opposite in sign to the 
torque which when applied to the bearing would produce an angu- 
lar misalignment 6 of the journal relative to the bearing. The 
torque components can be computed from the pressure distribu- 
tion by 


~ 
M, = -rf vdy f, p cos ¢ do (18) 
+3 
M, = rf vdy f, p sin d dd (19) 


9 


where M, is the moment component acting about the axis of angu- 
lar deflection of the journal, and M, is the moment component 
acting about an axis perpendicular to the angular deflection axis. 
It is clear from the form of the d@ integrals in Equation (18) and 
(19) that only first harmonic pressure terms (sin @ and cos ¢) 
will contribute to the load. The first-order pressure perturba- 
tion p; contains first harmonics in @, but the second-order pertur- 
bation p: does not. In general, all the odd-order perturbations 
Pi, Pa, Ps, etc., will contain first harmonic terms, the sum of which 
will determine the torque load on the journal bearing. Assuming 
that the predominant first harmonic terms are contained in p,, 
we can determine the approximate torque load from 


b 
+ 
M, = f cos dd (20) 


Mz = 


Substituting Equation (16) into (20) and (21), we obtain the 
approximate torque components along and perpendicular to the 
axis of angular deflection. 


ae 


b 
d 


338 / suNE 1960 


y IN INCHES 
Fig. 2 Axial pressure distributions (first-order) on a misaligned gas-lubricated journal 
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Figs. 3 and 4 describe the total moment vector as calculated 
from Equations (22) and (23) in terms of its magnitude, M = 


+ M.?, and its direction, = tan~! M:/M,, relative to the 
axis of angular deflection of the journal. 


Discussion of Results 


Unfortunately, there is no published experimental data on re- 
storing torques of misaligned gas-lubricated bearings with which 
to compare the theoretical results just obtained. On the basis 
of previous work on radially loaded journal bearing [1 and 2], the 
author expects the first-order perturbation solutions to provide 
adequate engineering design data for eccentricity ratios up to 
one half (€ < 1/2). In the present problem the eccentricity ratio 
increases linearly from zero at the center of the bearing to a 
maximum of 6b/2c at either end. 

The same criterion (total € < !/:) may also serve as a guide for 
the region over which the torque-loaded solution obtained herein 
may be superimposed on the radially loaded solution given in [2]. 
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Fig. 3 Theoretical angular stiffness characteristics (first-order) of misaligned gas-lubricated journal 
bearings 
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Fig. 4 Theoretical angle (first-order) between applied torque and axis of angular deflection 
of the journal in a gas-lubricated journal bearing 


In the region of low eccentricity ratios, the solutions to the linear- 
ized (first-order) equation are fairly good. Since the equation is 
linear, a linear sum of the two solutions (radial-load and torque- 
load) is itself a solution. 
The torque components, M, and M;, were determined by in- 
tegrating the gas film pressure distribution. It was tacitly as- 
sumed that viscous torques acting about axes perpendicular to the 
spin axis were negligible in comparison to the pressure derived 
torques. Actually, differential viscous moments must arise (Fig. 
5) because viscous shear stresses are inversely proportional to -+4> 
the local gap width which is asymmetric when the bearing is | MEE MOMENT DUE $e HeVEREMrA. 
The magnitude of these differential viscous stresses relative to F#8- 5 Development of viscous torques perpendicular to the journal 
the differential pressure stresses can be estimated as follows: ie a 


‘ nor Since Hk/6 is generally of the order of 1 while c/r< 1, the viscous 
Differential shear stress ™ As " = Hk ¢ torques perpendicular to the spin axis may be neglected in com- f 
Differential pressure stress Ap €Po 6 Fr parison to the pressure derived torques. ‘fie 
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Numerical Example 


Find the angular stiffness of a gas-lubricated journal bearing 
operating at 12,000 rpm in air at 100 F and 1 atmosphere pres- 
sure. The bearing diameter is '/; in., its length is 1'/; in., and the 
average radial clearance between journal and bearing is 10~¢ in. 


60 


6x (28 of time x 


X 


positive pressure loop and seem in fact to approach an asymp- 


totic value. There are no published data available for bearings 
running at elevated ambient pressures and perhaps the author 
knows of such experiments in which the discrepancy is not as 
high as that given. However, it remains fairly certain that this 
behavior is typical and that the boundary conditions in gas 


1 
me x (4) 
= 6.4 


kpoc? 


From Fig. 3 for H = 6.4 and b/d = 3 


12c-M 
mkp.rb% 
(0.733)" XK 1.4 X 14.7 x X (1.5)? in. 
or — = 
12 X 10-* in. XK 2.06 X 10 sec of arc/radian 
sec of arc 
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DISCUSSION 


0. Pinkus? 


It seems that a point long argued in the field of incompressible 
lubrication must now be renewed in the case of gas bearings. 
This point concerns the boundary conditions in solving the 
Reynolds equation. It is now fairly clear that except in rare 
applications, an antisymmetrical circumferential pressure dis- 
tribution is an incorrect, unreal representation of the actual 
workings of hydrodynamic bearings using liquids as a lubricant. 
This is basically true also of gas bearings. The quantitative 
error will be much smaller in gas than in liquid bearings, but this 
is due not so much to compressibility effects as to the inherent 
low load capacity of gas bearings. But even at low unit loadings 
the discrepancies still seem glaring. Following are some numbers 
for air bearings operating in atmospheric conditions, taken from 
two independent sources. 


Peak posi- 
Unit load- tive pressure, Peak negative 
ing, psi i pressure, psig 
2.1 -1.8 


3.: 


1 X 1 X 0.0012* 


As seen, the negative pressures are far from following the 
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bearings must suffer the same complications, if not worse ones, 
as do bearings lubricated with a liquid. 

This is not intended to be a complaint against the author for 
having used the boundary condition 


= + 2m) 


in his analysis for the writer is fully aware of the staggering 
difficulties in using the proper condition in an analytical solution 
of the differential equation. The ability to obtain analytical, 
albeit approximate, expressions is an achievement in itself. The 
point, however, is made that 


Pd) = p(d + 27) 


is not the correct boundary condition and that if digital com- 
puters are used in solving the compressible Reynolds equation 
an attempt be made to formulate a more realistic condition for 
the negative pressure loop. 


W. Wildmann® 


The author should be congratulated for making another 
valuable contribution to hydrodynamic gas bearing literature. 
Those who have had occasion to design hydrodynamic gas- 
lubricated bearings will certainly recognize the usefulness of 
Figs. 3and 4. To avoid large interpolations, a few more curves 
could have been included in these figures. 

While the author had no published experimental data on 
torques produced by misaligned gas-lubricated bearings, this 
writer has performed some experiments on the subject. The 
experimental setup consisted of a rotor supported on a gas- 


5’ Senior Research Engineer, Autonetics, a Division of North Ameri- 
ean Aviation, Inc., Downey. Calif. Assoc. Mem. ASME. 
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lubricated bearing subjected to gyroscopic torques. Measure- 
ments of attitude angle and eccentricity were made. Due to the 
presence of thrust pads in the experimental setup, the eccen- 
tricity measurements did not apply to the journal bearing alone 
and are therefore not reported. Measured attitude angles are 
shown in Fig. 6. It can be seen that agreement between the 
theory and the experimental data is very good except at low 
values of the dimensionless bearing parameter. This is because, 
at low bearing numbers, the eccentricity ratio was high and 
agreement with the author’s perturbation solution, valid only at 
low eccentricity ratios, cannot be expected. 

The good agreement between theory and experiment shows 
the validity of the paper; it also shows that agreement between 
theory and experiment is usually better for gas-lubricated bearings 
than for oil-lubricated bearings. This is surprising when one 
considers that complete solutions to Reynolds equations can 
usually be obtained for noncompressible lubrication while only 
approximate solutions exist for gas-lubricated journal bearings. 
The reason for this is the large number of assumptions concerning 
temperature, viscosity, and oil film breakdown that have to be 
made when noncompressible bearing theory is used. In gas- 
lubricated bearing theory, no such assumptions have to be made 
and the Reynolds equation with simple boundary conditions, 
as used by the author, gives a good description of the behavior 
of the lubricant film. 


Author's Closure 


Mr. Pinkus properly observes that peak positive (gage) pres- 
sures observed experimentally exceed the peak negative (gage) 
pressures in contrast to the symmetric pressure distribution pre- 
dicted by first-order perturbation theory. However, the pressure 
periodicity condition p(@) = p(@ + 27), cannot be blamed for this 
difference. Indeed, it is difficult to see how this condition could 
even be questioned. It states that the pressure at a point @ is the 
same as at a point @ + 2m” which is actually the same point. If 
this condition were incorrect, the pressure would be a multivalued 
function. That is, it could have several values simultaneously at 
the same point. 

Perhaps Mr. Pinkus is thinking about cavitation which plagues 
both the analysis and design of liquid-lubricated bearings. In 
liquid-lubricated bearings, the pressure on the negative loop may 
drop to the vapor pressure of the liquid but no further. At this 
point, part of the liquid changes state to a gas. The pressure 
gradient may be discontinuous under these conditions, but the 
periodicity requirement p(@) = p(@ + 27) for a journal bearing 
still must hold true. 

Of course cavitation cannot occur in a gas-lubricated journal 
bearing, and there is every reason to believe that under normal 
operating conditions the pressure distribution is not only single- 
valued, but also a well-behaved continuous function with con- 
tinuous derivatives. Possible exceptions are: 


(1) A grooved bearing with abrupt changes in film thickness 
which would cause abrupt changes or discontinuities in the 
derivative, 

(2) A gas lubricant operating near its critical point in which 
case liquefaction might occur on the positive pressure peak and 
cause a discontinuous derivative, or 

(3) A bearing operating with near-sonic gas velocities in which 
case shock waves might occur and cause an abrupt change in 
pressure. Even under these extreme circumstances, the pressure 
would still be single-valued. 

What is the correct explanation for the discrepancy between 
experimental pressure data and first-order perturbation pressure 
solutions? The answer is that the first-order perturbation solu- 
tion is only the first term in a series expansion of increasing order 
in eccentricity ratio. The series has the form 
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Oth-order Ist-order 2nd-order 


7 
Pp = po + €(F, sin @ + G, cos + + sin 26 + G, cos 26) 


3rd-order 


1 
+ F; sin + G; cos @ + sin + J; cos 3) 
4th-order 


1 


E, + Fysin 26 + G, cos 26 + Hy sin + 46) +... 


E,,, F., Ga, H,, and I, are actually functions of ¢, but for pur- 
poses of discussing the circumferential pressure distribution (at 
a constant value of [) they can be considered constants. 

Notice that the zeroth and first-order terms alone give a 
pressure which is sinusoidal with mean value po; the positive pres- 
sure peak is equal in magnitude to the negative pressure peak, as 
stated previously. If the second-order terms are included, the 
mean value of the pressure distribution becomes pp + €*Z; instead 
of po. In effect, the whole pressure distribution is elevated. The 
positive pressure peak is increased while the negative pressure 
peak is reduced in magnitude. The resulting pressure distribu- 
tion is asymmetric about p) in a manner similar to that observed 
experimentally. This then is the reason why the first-order 
perturbation solution is not necessarily a good approximation to 
the true pressure. There simply aren’t enough terms included 
from the series. 

Nevertheless, as stated in the paper, when the second-order 
terms are substituted into the load integrals and integrated over 
¢ from 0 to 27, the integrated effect is zero. The second-order 
terms do not affect the bearing load in spite of the fact that they 
significantly alter the pressure distribution (chiefly by raising 
in an algebraic sense both the positive pressure peak and the 
negative pressure peak). The e€? (F; sin @ + G, cos @) part of 
the third-order terms will contribute to the load, and these are the 
lowest-order terms neglected in the first-order result for the load. 
In other words, the first-order load expressions neglect terms of 
the order of €? with respect to €«. In the first-order solution for 
pressure itself, terms of the order of €* are neglected with respect 
to ¢ (if pressures are measured relative to pp» as is the case for gage 
pressure). It is to be expected that the first-order perturbation 
solution for gage pressure will be a rather poor approximation, 
particularly at high eccentricity ratios. 

In this regard it is well to point out that the data of Sternlicht 
and Elwell quoted by Mr. Pinkus in his discussion was taken at 
eccentricity ratios € between 0.45 and 0.80. Note thatate = 0.45, 
the peak positive pressure (2.2 psig) is not greatly different from the 
peak negative pressure (—1.8 psig). This fact lends credence 
to the author’s contention in the paper that he expects the first- 
order perturbation solutions to provide adequate engineering de- 
sign data for eccentricity ratios up to one half (€ < '/;). Un- 
fortunately, Cole and Kerr did not measure eccentricity. How- 
ever, from a comparison of their pressure data with the cata of 
Sternlicht and Elwell, it seems likely that the eccentricity ratio is 
greater than one half for all the points listed. 

The author hopes that the foregoing discussion will help put 
the first-order perturbation solution in proper perspective for the 
reader and point out some of its limitations as well as its useful- 
ness. The effect of the neglected higher-order terms is important, 
but it is difficult to discuss the subject adequately in the original 
presentation without disturbing the train of thought toward de- 
veloping a solution. The author wishes to thank Mr. Pinkus for 
providing this additional opportunity to clarify this point. 

The agreement between theory and Mr. Wildmann’s experi- 
mental data on torque attitude angle is most encouraging. Note 
that this agreement was achieved assuming k = 1 which cor- 
responds to isothermal lubrication conditions. 
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Friction Characteristics of Sliding Surfaces 
Undergoing Subsurface Plastic Flow 


It is well known that the load of an ordinary friction slider is supported by a large 


number of surface asperities having a collective area that is small compared with the 
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apparent area of contact. 
tically loaded. In many metalworking operations, such as wire drawing, extruding, 
rolling, and metal cutting, the bulk metal undergoes plastic deformation as sliding occurs. 
The influence of this subsurface flow upon the coefficient of sliding friction is discussed. 


The metal in bulk beneath such surface asperities is elas- 


A simple test procedure for studying the friction characteristics of sliding metal surfaces, 
one of which is being subjected to plastic flow in bulk, is described, and representative 


Woes one metal slides over another, the normal load 
W is generally supported by a relatively small real area of con- 
tact Az compared with the apparent area of contact A, Fig. 1. 
The characteristics of such lightly loaded sliding surfaces have 
been extensively studied for many years, see, for example, Ref- 
erence {1].!_ The tips of the asperities of the two surfaces are be- 
lieved to be plastically deformed sufficiently to provide a contact 
area Ap capable of supporting the load, and the friction force F 
is thought to result from the welding and plowing of asperities |1] 
or the interlocking of the slip-plane ends terminating in the 
asperities [2]. 

One of the classical laws of dry sliding surfaces is that of 
Amontons [3] which states that the coefficient of friction F is 
independent of the applied load W, where 


F 
(1) 


Sliders of the type shown in Fig. 1 are characterized by a 

sal area of contact that is very small compared with the apparent 
area of contact (Ap/A<1). For such sliders the zone of plas- 
tic deformation is confined to the asperities and there is no plastic 
deformation of the metal in the subsurface in bulk. It is for situa- 
tions of this sort that Amontons’ law holds. Essentially all of the 
research that has been done on friction in recent years has been 
associated with such lightly loaded sliders. 

There are many engineering problems where the sliding sur- 
faces are very heavily loaded and when an attempt is made to 
extrapolate the results for “‘lightly’’ loaded sliders to these cases 
major difficulties are encountered. Heavily loaded sliding surfaces 
are encountered in all of the processing operations that involve 
heavy plastic deformation, such as: Wire drawing, extrusion, hot 
and cold rolling of metals; cold rolling of gears, threads, and 
splines; impact and explosive forming; and the cutting and grind- 
ing of metals [4]. All of these operations are characterized by the 
sliding of surfaces, where plastic deformation is not confined to 
surface asperities, but extends into the subsurface layers of the 
bulk metal. The laws of dry friction have not been formulated 
for this region and the performance characteristics of lubricants 
are unknown. 


The relationship between the bulk plastic-flow friction regime 


1 Numbers in brackets designate References at end of paper. 

Contributed by the Lubrication Division of THe American So- 
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data are presented for both dry and lubricated sliding. 


and two adjacent bounding regions, that have been the subject of 
much research in the past, is shown diagrammatically in Fig. 2. 
Here the shear stress to which the apparent surface area A is 
subjected is shown plotted against the normal stress on the same 


area. Region I corresponds to relatively lightly loaded sliders 
where Amontons’ law (line OB of Fig. 2) holds. 

Region III, on the other hand, corresponds to the situation in 
ordinary materials testing where there is no free surface; i.e., 
where the real Ap and apparent A areas are identical. A charac- 
teristic result in materials testing is that the flow shear stress 7 
is independent of the normal stress on the shear plane o, and this 
result is shown graphically by line CD in Fig. 2. 

The region that has escaped study is the intermediate one 
(region II) which is characterized by the bulk plastic flow 
of region III, but the incomplete contact area of region I. Need 


A — 
Fig. 1 Lightly loaded sliding surfaces 
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Fig. 2 The three regions of solid friction 
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exists for a thorough examination of the laws of friction and of 
lubrication for region II and the application of these results to 
representative engineering processes. In this paper a test pro- 
cedure for performing friction experiments in region II is de- 
scribed and a few preliminary results presented. 


Apparatus 


In considering a possible friction test, where one of the sliding 
surfaces is subjected to bulk plastic flow, several possibilities such 
as simplified wire drawing, rolling, or cutting arrangements sug- 
gest themselves. However, all of these tests are relatively com- 
plex or employ specimens of complex geometry. A very simple 
and convenient possibility exists in a modified Brinell hardness 
test. If the specimen is mounted on a ball thrust bearing as 
shown in Fig. 3, then the torque required to rotate the specimen 
against the ball may be easily measured by means of a dynamome- 
ter. The necessary bulk plastic flow is provided by making an 
ordinary Brinell indentation at the beginning of the test. The 
plastic deformation that occurs beneath a Brinell ball is quite 
extensive as shown by the gridded surface of Fig. 4. This speci- 
men was produced by clamping two flat surfaces together, one of 
which was previously ruled with a grid, and then making a 
Brinell indentation at the interface of the two pieces. 


w 


Test Bal! 


Specimen 


RY Wooo 
Thrust 
Bearing 


Base of 
‘at Brinell 
Machine 


Fig. 3 Test apparatus 


In order to avoid the singularity that exists at the pole in the 
case of an ordinary Brinell test, it was found convenient to pro- 
vide a hole in the specimen at the point where the indentation 
was to be made. Experiments with balls and holes of different 
size revealed that neither of these quantities was critical, and a 
1/.-in-diameter ball used in conjunction with a 2-mm-diameter hole 
was found convenient. Ordinary bearing balls of AISI 51100 
steel operating on plates of AISI 1020 steel offer an inexpensive 
source of specimens. 

In the preliminary experiments reported here a spring balance 
was used to measure the torque necessary to rotate the specimen 
at a slow uniform speed relative to the stationary ball. The speed 
of rotation was without influence. Assuming the plate to be 
homogeneous and isotropic and the sphere to be so much harder 
than the plate that its deformation may be ignored, the mean 
friction force on the ball F was computed from the measured 
moment M (kg-mm) as follows 


F= (2) 
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The diameter of the hole d; after load was applied was smaller 
than the drilled size d;' due to plastic flow. Fig. 6 shows a typical 
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. 4 Zone of plastic deformation in Brinell test. Grid was ruled on 
flat surfaces which were clamped tightly together and Brinell impression 
made at interface between the two surfaces (photograph by C. T. Yang). 
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Fig. 5 Area of contact between ball and impression 
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where (Fig. 5) pl 
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and hence 

q 
d,? — d;? 
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1 
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Fig. 6 Section through representative ball impression and central hole 


impression produced by a '/;-in. ball loaded to 3000 kg. The 
diameter of the impression at the surface d, and the diameter of 
the collapse hole d; were measured in the usual manner with a 
Brinell microscope. 


Results in Air 


Results obtained using dry '/:-in-diameter balls and holes of 
different diameter, d,!, are given in Table 1. The values of mean 
shear stress r and mean normal stress @ were computed as 
follows: 


F 12M 
m(d,* = d;) 


T= 


(5) 


— 


W 


(dt = a2) 


Table 1 Tests performed in air using 1/2-in. diameter ball and 2-mm 
hole. Materials: Ball, AIS! 51,100 steel; plate, AIS! 1020 steel. 
1 
4 
™ Kg-mm Kg/mm" 
4.10 5.60 
5.95 
6.30 
6.60 


6.50 


500 
617 


15.2 
14.2 
15.9 
16.1 
17.5 
13.5 
15.2 
17.2 
17.9 
16.1 
18.3 
14.5 
16.9 
17.7 
18.2 
17.9 
18.4 
18.4 


4.50 

5.05 

5.40 

5.65 

6.05 

6.00 

3.81 

4.32 

4.75 

5.11 

5.48 

545 771 

5 .00 771 

Tests Under Light Load 

22.7 6.35 18.6 
90.7 6.50 6.35 64.8 
181.4 6.60 6.30 139. 


f = 0.26 
59.8 
59.6 


13.4 
13.2 


“No plastic deformation in bulk. 


The tests for increasing loads (1500 to 3000 kg) were performed 
upon the same hole, the load being increased to the next higher 
value after the torque had been measured. The duplicate test 
for the 3000-kg load was performed by applying the full load 
directly without stopping to measure the resistance to rotational 
motion. It is evident that while the two sets of values for the 
3000-kg loads are closely similar, the values of r and o@ for 
the directly applied load are slightly greater in all cases. The 
values of r and o from Table 1 are shown plotted in Fig. 7. 

The values given at the bottom of Table 1 were obtained using 
very light loads applied to a '/,-in. ball centered over a 0.250-in- 
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diameter hole. The first load was so light that there was no evi- 
dent plastic deformation at the surface of the hole at the com- 
pletion of the test. The other two tests produced amounts of 
plastic flow that were barely visible at the edge of the hole. 

The test for the lightest load, for which there was no subsurface 
plastic flow, yields the line marked ‘‘Amontons’ law.’’ It is seen 
that above a normal stress of about 30 kg/mm? the friction curve 
ceases to obey Amontons’ law, and the coefficient of friction con- 
tinues to decrease with increase in normal stress, ¢. Lines of con- 
stant coefficient of friction are shown in Fig. 7 for reference, and 
it is seen that whereas the coefficient of friction is 0.26 for a very 
light load, it drops to a value about half as great for the heavier 
loads. 


if 


Amontons Low — 


Fig.7 Variation of + with o for unlubricated tests using holes of different 
diameter 


A question frequently asked is why the coefficient of sliding 
friction on the surfaces of a wire-drawing die is so very low 
(0.10), while the coefficient of friction for the same metals 
when lightly loaded is substantially higher. The answer ap- 
pears to lie in the fact that sliding is accompanied by large 
amounts of subsurface flow in the case of a drawing operation and 
the presence of significant flow in bulk gives rise to a decrease in 
the coefficient of friction as illustrated in Fig. 7. 

From Fig. 7 it is evident that the size of hole is not critical. 
A hole of 2-mm diameter was found convenient and was adopted 
for subsequent experiments in which solid and liquid lubricants 
were applied. 


Results With Lubricants 


Table 2 gives the observed and computed data for a variety of 
lubricants and these results are shown plotted in Fig. 8. In 
making these tests the central hole was filled with the lubricant 
and a thick layer extended over the surface. Excess solid or 
liquid lubricant was displaced from the test area as load was ap- 
plied to the ball. The dry curve from Fig. 7 is shown dotted for 
reference. Those materials that yielded results below the dry 
curve may be considered to be good lubricants under the heavy 
flow conditions of this experiment. These materials include 
molybdenum disulfide and graphite which are two well-known 
solid lubricants together with ferric chloride. Ordinary mineral 
oil is seen to give results that are only slightly better than those 
for air. 

Carbon tetrachloride, manganese sulfide, and finely divided 
aluminum oxide were found to be poor lubricants. While it is not 
surprising to find abrasive aluminum oxide among the poor 
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Fig. 8 Variation of + with o for lubricated tests 
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Table 2 Tests performed with lubricants. Bell diameter, 1/2 in.; hole 
diameter, 2 mm. Materials: Ball, AIS! 51,100 steel; plate, AIS! 1020 


Lub. Kg mm Kg-mm Kg/mm Kg 
cl, 1000 4.06 1.60 439 26.4 92 
2000 4.80 1.40 770 27.0 120 
3000 5.55 1.22 1220 27.1 125 
MoS 1000 3.95 1.70 124 8.3 93 
2000 4.75 1.4) 354 12.9 124 
3000 5.50 1.15 479 11.2 132 
Mineral 1000 4.00 1.70 188 12.2 97 
2000 4.81 1.49 509 17.8 122 
3000 5.42 1.34 769 18.6 139 
Al,0, 1000 3.90 1.90 354 25 6 110 
2000 5.20 1.40 1700 46.5 180 
Graphite 1000 3.75 1.65 12% 9.8 113 
2000 4.60 1.30 276 11.0 131 
3000 5.50 1.10 895 20.6 132 
MnS 1000 3.80 1.85 216 16.9 116 
2000 4 82 1.40 664 22.9 120 
3000 5.60 1.45 1270 27.8 132 


3.80 1.70 124 9.4 Wl 
2000 4.65 1.41 310 11.9 130 
3000 5.50 1.13 510 11.7 132 


lubricants, it is revealing to find carbon tetrachloride and man- 
ganese sulfide yielding such poor results. In the metal-cutting 
literature it is frequently reported that manganese sulfide present 
in free-machining steels is beneficial by virtue of its ability to 
lubricate the chip-tool interface. This theory can be shown to be 
incorrect in terms of fundamental cutting-force studies and 
the results reported here support the view that manganese sul- 
fide is not a good solid lubricant. 

While carbon tetrachloride is in fact an effective cutting fluid 
at low cutting speeds, this is not due to its lubricating ability. 
The beneficial action of carbon tetrachloride is due to the chemi- 
cal action that occurs at the tip of a cutting tool and the in- 
fluence this action has upon the tendency for the chip to strain 
harden [5]. Although ferric chloride is formed in the cracks which 
propagate at the tip of the cutting tool the amount produced is 
exceedingly small. If substantial quantities of ferric chloride were 
formed at the tool tip this material should act as an effective 
lubricant. The fact that insufficient ferric chloride is produced 
at the tool tip to act as an effective lubricant when cutting in the 
presence of carbon tetrachloride can be demonstrated by re- 
versing the tool at the end of a slow-speed planing cut and meas- 
uring the friction on the return stroke. When this is done the 
friction is found to be higher than that obtained in air, clearly 
demonstrating that the coefficient of friction between tool and 
work is that corresponding to carbon tetrachloride and not to 
ferric chloride. 

When carbon tetrachloride is used as a wire-drawing lubricant 
the result is not confused by superposition of an influence on the 
fracture process at the tool tip and a lubricating function on 
the tool face. In the case of wire drawing, carbon tetrachloride 
can act only as a “‘lubricant’’ and in this case it is found to cause a 
significant increase in the coefficient of die-face friction, as Fig. 8 
would suggest. 


Concluding Remarks 
It is shown that in general three regimes of friction exist de- 
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pending upon the ratio of the real to the apparent area of contact. 
The simple modified Brinell test described enables data in the 
intermediate-friction regime to be obtained and the relation of 
the friction characteristics for this region to that of the conven- 
tional region of lightly loaded sliders to be compared. The simple 
test procedure presented should prove useful in studying the 
detailed action of the lubricants for use in drawing and other 
processes involving extensive subsurface plastic flow. It would 
further appear that useful wear results could be obtained from the 
simple system consisting of a Brinell ball rotated at constant 
speed in its impression. 
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Edmond E. Bisson? 


The results reported by the authors utilizing a simple Brinell 
system are very interesting, particularly those on lubricated tests. 
The authors’ results confirm the fact that graphite, MoS,, and 
chloride of iron are very effective lubricants at high contact 
stresses. These results are, however, not surprising in view of 
the extensive research results obtained on this type of lubricant 
at very high contact stresses using a sphere sliding on a flat. 

The results obtained with carbon tetrachloride are, however, 
quite surprising in view of the extensive mass of literature show- 
ing that CCl, is an effective cutting fluid at low cutting speeds. 
Since the generally accepted hypothesis for lubrication by CCl 
involves reaction at the surface to form chlorides, it would seem 
the simplified Brinell test should, because of its low speed, give 
ample time for reaction. The authors’ data show that ferric 
chloride, FeCls, is quite effective and considerable data are availa- 
ble in the literature to show a marked effectiveness of ferrous 
chloride, FeCl, references {1] and [2}.* Since there would appear 
to be ample time for reaction to form chlorides of iron, a much 
lower friction coefficient should be expected. 

I am not certain I understand the authors’ statements “If 
substantial quantities of FeCl, were formed at the tool tip, this 
material should act as an effective lubricant. The fact that 
insufficient FeCl, is produced at the tool tip to act as an effective 
lubricant when cutting in the presence of CCl can be demon- 
strated by reversing the tool at the end of a slow speed planning 
cut and measuring the friction on the return stroke. When this 
is done the friction is found to be higher than that obtained in 
air....’’ It would appear to me that friction coefficient meas- 
ured in such a manner (if I have deduced correctly) would al- 
ways be higher than that for FeCl, because the FeCl, in most 
cases would be worn off immediately. Maintenance of low 
friction requires continual regeneration of the surface film. 


? Assistant Chief, Fluid Systems Division, National Aeronautics 
and Space Administration, Lewis Research Center, Cleveland, Ohio. 
Mem. ASME. 
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David Tabor‘ 


This paper is valuable in focusing attention on the fact that 
the fundamental quantities involved in friction are the area of 
true contact A and the specific shear strength 7 of the interface. 
If the conditions are such that the area of true contact is propor- 
tional to the load and independent of the size of the bodies then 
(assuming T remains roughly constant) Amontons’ laws of friction 
will be obeyed. Butif the geometry or the nature of the deforma- 
tion process imposes a different law on the variation of A with 
load and area there will necessarily be a correspondingly different 
law of friction. This is indeed what the authors observe. 


4 Cambridge University, Cambridge, England. 
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Since in their measurements the friction is usually less than 
B = 0.2 it is probable that little junction-growth occurs during 
sliding and that there is little interaction between the tangential 
and normal stresses. Consequently, the true critical shear 
strength of the interface must be close to the values deduced 
from the frictional measurements. 

There are two points arising from this that I should like to 
raise: 


1 Do the authors consider that the true area of contact is 
indeed the same as the geometric area? 
dentations this is not necessarily so. 

2 There is, in general, a small increase in critical shear stress 
with increasing load. Do the authors think this is due to the 
effect of pressure on shear strength or simply to increased break- 
down of surface films?’ The evidence rather supports the latter 
view since with some of the lubricant films used (e.g., graphite 
and MnS) there is a very large increase in shear strength with 
load. One wonders if this was accompanied by any marked 
increase in surface damage. 


Even in hardness in- 
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a advantages of the addition of lead to steel have 
long been known, but only recently have they been measured 
quantitatively [3]}.!. Specifically, the presence of lead has been 
shown to reduce cutting temperatures, increase tool life, and per- 
mit increased cutting speeds. These benefits, however, have been 
detected and utilized in production in areas where one or more 
of the following conditions occurred—low cutting speed, soft 
workpiece material, or thin chip [15, 21]. Little or no informa- 
tion has been reported on leaded steel used as a general-purpose 
material for high speeds and for heat-treated conditions. Some 
evidence was found which indicates that the advantages of lead 
addition decrease for hardnesses above 30 Rockwell C [22], but 
no information was found for high cutting speeds [500 surface 
feet per minute (sfpm) and above]. Therefore it was decided to 
investigate the effects of lead addition for a wide range of speed 
and a wide range of strength levels. This extension of existing 
data will indicate the possible future applications of leaded 
steels, 


Design of Experiment 


Although it would be desirable to determine the interactions 
of stock chemical composition and lead addition, the amount of 

! Numbers in brackets designate References at end of paper. 
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Machining Characteristics of Leaded Steel 


The effects of lead addition in alloy steel upon the metal-cutting process were explored 
over a wide range of conditions. In particular, a range of cutting speeds (from 50 to 
800 fpm) and workpiece hardness (from 230 to 450 Bhn) were investigated on one work- 
piece material (4340) using principally a carbide (C-6) cutting tool. 
(two-dimensional) data was taken to describe the metal-cutting process, and tool-life 
data were obtained by running a typical production tool to failure at the various cutting 
conditions. Several mechanisms to explain experimental results, including lead acting 
as a lubricant, are discussed. 


testing required to obtain statistical significance for a wide range 
of cutting speed and material strength level would be beyond the 
scope of this paper. Therefore one material was used for the 
investigation. The material selected was 4340 and it was chosen 
for its favorable depth-hardening characteristics which would 
facilitate testing. A split heat was purchased to insure minimum 
chemical-composition variation between the leaded and plain 
material. Hardness levels of 235, 285, and 444 Bhn were used; 
the corresponding heat-treatments and microstructures are tabu- 
lated in the Appendix. All machining was done using C-6 carbide 
inserts. 

Two types of information were sought in this investigation. 
First, an insight into the mechanism by which lead affects the 
cutting process, and second, an extension of machining data to 
higher cutting speeds and increased hardness levels. Fulfillment 
of the first of these objectives utilized orthogonal cutting while the 
second relied on full tool-life testing 

Orthogonal Cutting. To determine the effects of lead addition 
upon metal cutting, it is necessary to describe the process by per- 
tinent measurements such as shear angle, machining coefficient 
of friction, and so forth. The fact that regular lathe turning is 
essentially three-dimensional, makes the task difficult, if not im- 
possible, at the present state of the art. One difficulty is im- 
mediately evident, i.e., that shear is taking place on two opposing 
planes, one with a width of the depth of cut and one with a width 
of the feed. In addition, the nose radius complicates the shearing 
process as do the rake angles and the side cutting-edge angle. 
Under certain conditions [17] an approximation can be made 
using a relatively simple three-dimensional model, but the error 
in results may still be large and the computations tedious. 

An orthogonal type of cutting was selected for describing the 


= Taylor-equation exponent of feed X = arithmetic mean of sample = exponent of feed in F, relation 
A = area machined in tool-life test, sq 7 = shear stress, psi b = exponent of depth in F, relation 
in. Y = shear strain 4, = tool temperature, deg C 
V = surface speed of material to be cut, R = range of sample R = aTg+y 
fpm @ = shear angle, deg 
N = number of test items = coefficient of friction, machining y = cost of cutting edge, cents 
n = Taylor-equation exponent of 7 4 g poe ge at revolution z = cost of machine, operator, and 
a = rake angle, side 
r = correlation coefficient 1 lif Pay lor-equation constant 
ae pe = tool life, min I = Knoop hardness number 
= normal stress, psi jensity of the steel stock. gr: 
” a ; , p = density of the steel stock, grams, C; = Knoop hardness constant 
D = diagonal of Knoop indentation, em! 
Filer unite Co = constant—F, relation 
. ' k = thermal conductivity, cal sec per m = “constant” in temperature rela- 
F, = feed force, lb em deg C ion 
F, = tangential force, lb ce = specific heat, cal per gram deg C T, = tool changing time, min 
r, = chip-length ratio \ = friction angle, tan~! wu ¢y’ = ordinate of plasticity plot, deg 
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metal-cutting process in this paper because the results would be 
more accurate and the computational work reduced. The results, 
however, may not describe production turning because test con- 
ditions and tool geometry are altered to approximate the orthog- 
onal cutting. In the present case it was felt that, although the 
orthogonal tests would not give a quantitative description of 
production turning, the tests would give valid qualitative in- 
formation. 

The most widely used method of obtaining orthogonal data is 
to use a tube of the desired material as the workpiece and a cut- 
ting twol such that its cutting edge is perpendicular to the axis of 
the tube and removes a chip of width equal to the tube-wall 
thicknéss. An alternate method of generating orthogonal data 
was to design a tool geometry that would accomplish two-dimen- 
sional cutting on solid bar stock. It was reasoned that if the side 
‘cutting-edge angle and the rake angles were zero, the nose radius 
very small, and the feed small with respect to the depth of cut, 
the result would approximate orthogonal cutting. These con- 
ditions were tested and it was found that the width of the chip 
was less than 0.008 in. larger than the depth of cut. This fact re- 
vealed that there was little side flow in the chip, and therefore, 
that the cutting mechanism was essentially two dimensional. 
The conditions used to obtain the orthogonal data were as 
follows: 


BR 
SRa 
ER 
SRe 
ECEA 
SCEA 
NR 


0 deg 
0 deg 
5 deg 
5 deg 
5 deg 
0 deg 
0.20 in. 


Negative land = 0.0005 in. at 45 deg to the top 
face of the insert 

Feed = 0.005 in. per revolution 

Depth of cut = 0.200 in. 

Chip breaker = 3/;, in. from the cutting edge 
(no contact with chip) 


+ @ See Fig. 1 for tool nomenclature. 


Because variation in tool grinding may increase the scatter in 
test data, special control was exercised in grinding the tools (for 
both orthogonal and tool-life testing). Each surface was ground 
with a 320-grit resinoid-bonded diamond wheel, so that the re- 
sulting surface finish averaged 8 microin. or less. The top face of 
all tools, however, was lapped by hand on a cast-iron block with 
400-grit boron-carbide abrasive. 

Preliminary tests [1] were run to determine where in the life of 
the orthogonal tool it is appropriate to take representative data. 
The results for carbide tool under the previous orthogonal condi- 
tions were as follows: (a) The machining coefficient of friction 
increases from an initial minimum value and approaches asymp- 
totically a higher value as the tool wears; (b) the length of time 
to this steady-state condition is a function of the cutting speed; 
and (c) a single tool will never ‘“‘wear in’’ (i.e., give a repeatable 
value of coefficient of friction) when it is used at intermittently 
varying speeds. Therefore it was concluded that the only con- 
sistent method of obtaining orthogonal carbide data for this 
paper was to use an initial value which was also the minimum 
value. After a tool had been used once, it was lapped and used 
again. Tools were not used more than these two times between 
grindings. No difference in results was detected between the 
freshly ground and the lapped tools. On the other hand, when a 
tool was lapped four or five times it gave varying results because 
the cumulative wear on the nose of the tool was not completely 
removed by lapping. 

Tool-Life Testing. The full tool-life tests were conducted at vary- 
ing cutting speed for each hardness, thereby furnishing informa- 
tion for a wide range of production applications. To facilitate 
usage of the data, the tool geometry chosen was considered typical 
for a rough turning operation with a throw-away insert-type tool, 
using a 0.011 feed and 0.100 depth of cut. This negative-rake 
tool holder is described in Fig. 1. To standardize the testing and 
reduce data variation the chip-breaker distance from the cutting 
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edge was */,. in.; and instead of honing, the cutting edge was 
chamfered with a negative land at 45 deg and with 0.001 in. 
width as measured from the top face of the insert. 

Complete tool failure was selected as the tool-life test criterion. 
A previous study [8] revealed that full tool-life testing is always 
correct while accelerated testing techniques often incorrectly 
describe the effects of machining variables. 


Test Equipment 


All testing was performed on a 25-hp, 2'/;-in. turret lathe 
modified for machinability testing. The lathe is driven by a 
variable speed d-c motor which provides an infinite speed varia- 
tion up to a spindle speed of 2000 rpm. A servosystem is in- 
corporated into the drive to maintain constant motor speed 
irrespective of load. The time constant of this system is 2 sec 
with a steady-state error of 1 per cent. This arrangement allows 
accurate control of the machining surface speed. Surface speed 
was measured with a friction-wheel hand tachometer to an ac- 
curacy of 2 per cent. Tool forces were measured by a three- 
dimensional strain-gage dynamometer. The tool holder was 
clamped into the dynamometer by set screws with positioning 
accomplished by the use of shim stock. A four-channel oscillo- 
graph recorder provided a continuous record of tool forces during 
machining. 


Test Results 


Because of the number and variety of data, the specific results 
of orthogonal cutting and tool-life testing are presented and in- 
terpreted separately in the first sections of test results. These 
data are followed by a discussion relating the results to the over- 
all effects of lead addition. 


TOOL SIGNATURE -5,-5, 
BR-BACK RAKE 
SRa-SIDE RAKE 
ER-END RELIEF 
SRe — SIDE RELIEF 
ECEA-END CUTTING EDGE 

SCEA-—SIDE CUTTING EDGE 
NR-NOSE RADIUS 


Fig. 1 Simplified insert-type too! holder 
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Orthogonal Cutting. presents the results of the 444-Bhn test where by necessity a 

Tool Forces. Figs. I-A, I-B, and I-C show forces on the arrow range of speed was investigated. At the higher speeds 
orthogonal cutting tool as a function of speed for 235, 285, (200 to 300 sfpm) there is a wide scatter in the results because of 
and 444 Bhn, respectively. (The half-shaded circles indi- rapid tool wear. This wear was evidenced mostly on the nose 
cate where leaded and plain-steel data coincide.) In each adius of the tool. It can be seen in Fig. II-C that the wear was 
case, above some critical speed, both leaded and unleaded steel 80 rapid at 300 sfpm that no data could be taken for the plain 
have the same feed and tangential forces. Below this critical steel. Throughout all the orthogonal testing, regardless of the 
speed the leaded steel forces are lower than the unleaded steel cutting speed, the leaded chips showed greater curl than did the 
forces. As the hardness of the material increases from 235 to Unleaded, Fig. 2. 


444 Bho, the critical speed decreases from 200 to 150 sfpm. Summarizing these results of coefficient of friction, the ‘‘bene- 
These force curves for the plain steel are typical of those previ- ficial” effects of lead addition, 1.8. reductions of 4 appear only 
° ously reported in machining literature. As the speed increases below some critical speed which is a function of hardness and 


from a very low value the forces on the tool increase to a maxi- Probably other machining conditions such as feed and tool 
mum, and as speed continues to increase the forces decrease geometry. 


asymptotically to some constant level. Shecr Angle. Figs. III-A, III-B, and IlI-C contain the results 
Coefficient of Friction. It may be seen from Fig. II-A that the of shear angle as a function of speed for varying hardness. 
° observable effect of lead addition at 235 Bhn is below 200 sfpm These curves are compatible with the curves of coefficient of fric- 


where the machining coefficient of friction u for leaded steel de- tion, although below the critical speed, the shear angle for 
creases while yu for the plain steel increases for decreasing speed the leaded steel 1s greater than the shear angle for the plain 
and then decreases. The results at 285 Bhn, Fig. II-B, were ‘steel. This is to be expected. A decrease in yw is generally 
similar to those above in the observed range of speed. Fig. II-C associated with an increase in rake angle. In contrast to the 

tool forces, which decrease asymptotically to a constant value 


with an increase in cutting speed, the shear angle increases asymp- 
totically to a constant value as speed is increased. Both co- 
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Fig. 2 Representative orthogonal chips (235 Bhn, 400 sfpm) 
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efficient of friction and shear-angle results reveal the same 
critical speed. 

Cutting Energies. An energy analysis (Figs. 3, 4, and 5) from 
the previous orthogonal cutting data displayed the fact that the 
friction energy remained nearly constant as speed varied while 
the shear energy tended to decrease as the speed increased. 
Above some critical speed, which again appears to be a function 
of hardness, there is no difference between any of the energies for 
leaded and plain steel. Below this critical value the leaded steel 
had lower shear energy and thus lower total energy than did the 
unleaded steel. 

Results of the Orthogonal Tests. These can be summarized as. fol- 
lows: 


1 Differences between leaded and plain steel were detected by 
orthogonal cutting at low speeds for all workpiece hardnesses 
tested. 


Transactions of the ASME 


LEADED 
f w 
° j 
« 
< 
3 
: 10 @ PLAIN 
] © BOTH LEADED 
& PLAIN 
+ 
o+ 
LEADED 
oe < © BOTH LEADED a 
a & PLAIN 
T 
6+ 
5+ 
% 


—— 


2 The tool forces, machining coefficient of friction, shear 


angle, and shear energy displayed these differences below the 
same critical speed for given material hardness. 

3 The effect of lead addition on the metal-cutting process is a 
function of speed and workpiece hardness. 

4 The leaded-steel chips always eurled more than their plain 
counterparts. 


Tool-Life Testing. Tool-life data as a function of speed for the 
three hardness levels tested appears in Figs. IV-A, IV-B, and 
IV-C. The lines through the data were obtained by a linear 
regression of the log of tool life (sq in.) on the log of surface 
speed (fpm). The mathematical equation for each set of data 
appears on the curve as well as standard error estimates which 
include 95 per cent of the test data. For clarity the lines for these 
standard error estimates do not appear on the curves, but their 
distance from the regression lines is given in inches (on these 
plots one log cycle is 5 in. in length). The values of the correla- 
tion coefficient for both the leaded and the plain steel are given 
in the figures. The form of the equation appearing in these 
figures is the widely known Taylor plot and the dimensions of the 
equations, to agree with convention, are V in fpm, and T in 
minutes. 

From the tool-life plots (Figs. IV-A and IV-B) it can be seen 


that the leaded-steel data has a greater slope. (This is shown 
also on the Taylor-plot equation where the exponent of T' is al- 
ways less for the leaded steel.) A “Student’s?’’ test was conducted 
to determine whether there was a significant difference between 
tool life for the leaded and nonleaded data. It was concluded 
from this test that there was no statistically significant difference 
between leaded and nonleaded tool life when data were taken 
over the full range of speed tested for the 235 and 285-Bhn ma- 
terial. It should be pointed out that this does not mean there is 
no difference between tool life, but only that none was proved 

in these tests the scatter in the data was large with respect to any 
difference in tool life of leaded versus nonleaded steel. An 
analysis of tool-life data at the upper speed did show a difference 
in tool life between the leaded and plain steel. At 800 sfpm tool 
life on the plain steel at 235 Bhn was greater to the 0.05 
confidence level and at 700 sfpm tool life on the plain steel at 285 
Bhn was greater to the 0.10 confidence level. On the other 
hand, for that part of the range of speeds tested which would be 
used in production (less than 600 fpm) no difference in tool life 
between the leaded and plain steel was proved. A similar analysis 
was not made for the 444-Bhn material because of the manner in 
which these data were obtained. At the high hardness level there 
was no point in the life of the tool which clearly indicated failure. 


Tabulation of Knoop diagonal squared for 235 Bhn 


sfpm Position X,x10-* R, 1073 
*100 Center || 425 47 16° 
100 Edge L 392 58 16° 
*100 Edge + 387 47 16° 
200 Edge ry 430 35 20° 
200 Center jj 474 90 20° 
310 Center | 450 93 24° 
310 Edge 4 422 59 24° 
L 
Knoop hardness number = RC, 


where 1 = eD; e = magnification .. C; = 1924 x 105 
The diagonal is given in Filar units. 


= arithmetic mean of sample 
R = range of sample 
N =} number of readings in sample 


The confidence level is the probability of being in error in assumin 


Confidence 
10-* Rt x 10-3 level N 
489 50 23° 64 90% 4 
413 36 28° 21 % 6 
433 43 23° 46 95% 5 
409 46 20° 21 90% 6 
473 36 20° 1 10% 6 
445 61 24° 5 20% 4 
415 84 24° 7 50% 6 


g the two samples are from the same population. The statistical test 


was a “Student ¢’”’ test using the range as an estimate of standard deviation [9]. 
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Tool failure resulted from increasing vibration due to excessive 
cratering close to the cutting edge of the tool. For convenience 
and because no other criteria proved suitable, 40 per cent increase 
in the feed force on the tool was selected as a point of failure. This 
definition of full tool life may partially explain the configuration 
of the leaded-steel data appearing in Fig. IV-C. However, chip 
formation of the leaded steel has a direct bearing on this curve. 
While the plain steel always exhibited a continuous chip, the 
leaded steel revealed a discontinuous chip below 200 sfpm, and a 
continuous chip at 200 sfpm. 


Effects of Lead Addition 


To augment the information directly obtainable from the 
orthogonal tests, several specialized investigations were performed 
seeking the causes of the results previously discussed. 

The first was a microscopic comparison of the leaded and 
plain steel chips. The chips from orthogonal cutting were 
mounted in plastic and ground until a representative longitudinal 
cross section was obtained. The specimens were then etched 
and photomicrographs taken (see Fig. 6). An examination of Fig. 
6 revealed the following effects of lead below the critical speed 
(200 sfpm): (a) Decreased chip thickness; (b) decreased rough- 
ness of the back of the chip; (c) decreased “‘wipe-down.”’ The 
first observation is directly related to the increased shear 
angle produced by lead addition. The decreased roughness of the 
back of the chip may be a function of the shear angle. Wipe- 
down, as used in this paper, pertains to the curvature of the 
chip grain-deformation lines on the tool side of the chip caused by 
secondary shear at the chip-tool interface. The plain steel chip 
has a nearly constant wipe-down with increased speed. How- 
ever, the leaded-steel chip has very little wipe-down at the lowest 
speed, but increased with increasing speed until above the criti- 
cal speed where there is no difference in wipe-down between the 
leaded and plain chips. 

To further explore the relationship between yu and wipe-down 
1113 (leaded and plain) steel at 190 Bhn was machined using 
orthogonal cutting and the chip was mounted similarly to the 
above. The reason for the choice of 1113 was the free-machining 
properties of this steel (decreased yu and chip thickness) which are 
similar to those achieved by leading 4340. Fig. 7 displays the 
fact that, although yu is decreased in the plain steel (compared 
with 4340, Fig. 6), the wipe-down is increased. The addition of 
lead to this steel produced a decrease in wipe-down similar to that 
produced in 4340 steel, and also a decrease in ys below the critical 
speed (500 sfpm). The higher critical speed of the 1113 leaded 
steel should be noted. The factors involved in the position of the 
critical point will be discussed later in this section. 

At this stage a more complete speed range was run on the 4340 
steel and these chip photomicrographs are shown in Fig. 8. Fig. 8 
completes the speed range of Fig. 6. Also included is a leaded- 
steel chip that was welded to the tool tip when the cut was in- 
terrupted. 

Included in Fig. 8 is a photomicrograph of a chip after Knoop- 
hardness readings were taken on it. After some experimentation, 
the placement of the indentations, as shown in Fig. 8, was chosen 
to reduce variation in hardness readings. Readings in the center 
of the chip were taken parallel to the flow lines, while those at the 
edge were taken at an angle perpendicular to the center readings. 

The square of the length (D x 10~*) of the longer diagonal of 
the indentations was recorded and a statistical tabulation is given. 

The chip hardness varies as the inverse of D* tabulated above, 
therefore the leaded steel produces a softer chip at speeds below 
the critical speed. From Fig. 6 it can be seen that 100 sfpm is the 
point of maximum difference between leaded and plain steel at 
these conditions. The leaded chip of the row marked by * was 
produced by an orthogonal tool of negative rake (a = —5 deg)— 
all others had an @ of 0 deg—to obtain leaded and plain chips sub- 
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jected to approximately the same shear angle. At the critical 
speed (200 sfpm) the leaded and plain chip hardness is similar in 
the center but different at the edges. As the speed further in- 
crease3 the hardness at the edge also tends to become similar. 

Data obtained from a BUE survey (orthogonal cutting) at 
various feeds and speeds is found in Fig. 9. The critical speed was 
not sensitive to changes in feed, Fig. 9(A). It can be seen from 
Fig. 9(B) that the size of the BUE, average height above top face 
of tool, does not appear to be the controlling factor of the critical 
point. The contact areas recorded in Fig. 9(C) further document 
the increased chip curvature (decreased contact width) observed 
qualitatively in previous tests. 

The following theories may offer insight into the mechanism 
whereby lead addition alters the metal cutting process: 

1 Real versus apparent area 

2 Lubrication 

3 Active shear planes 

1 The real-versus-apparent-area theory presented by Shaw 
[18] applies to the motion of the chip across the tool. It is pro- 
posed that this action is neither simple friction of an elastic 
body, where the real contact area is a small percentage of the ap- 
parent area, nor pure plastic flow, where sliding occurs within the 
material, and the apparent and real area are equal. This rela- 
tionship is shown in Fig. 10(A). In the elastic region (1) the 
shearing frictional stress 7 is proportional to the normal stress 
@ and in the plastic region (2) the shear stress is independent of 
the normal stress. Instead, it is proposed that motion at the tool- 
chip interface is a combination of elastic and plastic phenomena. 
An increase in shear stress so as to move from point A to point B 
would then decrease the machining coefficient of friction from 
to Mp. 

A decrease in wipe-down by the addition of lead is direct evi- 
dence that the shear stress has decreased and that the action is of 
a less plastic nature than without lead. However, the machining 
coefficient of friction decreased with the addition of lead. There- 
fore it must be concluded that lead changes the character of 
machining so that this change must be described with a different 
shear-normal-stress relationship. The lubrication theory suggests 
such a relationship. 

2 The lubrication theory was developed to explain the 
mechanism of machining leaded steel. The argument used is that 
the lead smears out between the chip and the tool to form a 
lubricating layer. If the lead in the steel does in fact act as a 
lubricant, the friction force would be reduced, causing the de- 
crease in 4 that was observed below the critical speed. The de- 
creased wipe-down of the leaded steel is in direct support of the 
theory. The reduction of the friction force F, and yu related to 
the preceding would produce the increase in shear angle and de- 
crease in chip thickness that were observed when machining 
leaded steel. On the shear-normal stress diagram, Fig. 10(B), 
this reduction in real coefficient of friction by lead addition can be 
portrayed by a curve lower than the first but asymptotic to the 
same shear stress for plastic flow. The addition of lead then would 
cause a movement from point B to point B’ with the result of 
lower shear stress and machining coefficient of friction. 

3 The increased-number-of-active-slip-planes theory suggested 
by Rebinder and used to explain the effects of CCl, by Shaw [19] 
could be present in the machining of leaded steel. If the finely 
dispersed lead particles were to act as stress concentrations the 
number of active slip planes could be increased and the shear 
strain on each plane reduced, thereby producing a softer more 
homogeneous chip. The decreased roughness of the back of the 
leaded-steel chip could be the direct result of this mechanism. 
It is difficult to assess the value of this theory because the princi- 
pal cause of the decreased hardness for leaded steel is the increase 
in the shear angle previously discussed in connection with 4340 
steel. 
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Fig. 7 Orthogonal chip photomicrographs 


If it is assumed that the strain-hardening properties of the 
leaded and plain steels will be equal, then the comparative chip 
hardness will be a function of y only. For orthogonal cutting the 
following relationship exists: 


cos @ 


sin @ cos (@ — a@) 


y= 
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Evaluation of y in this case presents a problem because the 
relation is not strictly valid in the region of BUE which is also the 
region of the influence of lead addition. Therefore this relation 
may only be used with reservation, and with the effect of BUE 
upon the real rake angle included. It can be seen that an in- 
crease in @ will produce a decrease in y. To determine the effect 
of BUE upon y, a plastic-flow analysis according to Lee and 
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Fig. 8 Chip photomicrographs 


Shaffer [7] was conducted. It was determined that all the shear 
(neglecting wipe-down) occurred in the region of the BUE and 
shear plane. From this it would be reasonable to obtain an aver- 
age @ and apply it to the shear-strain equation. The results of 
such an approximation using values obtained from Fig. 11 reveals 
that the shear strain of respective chips are 
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Fig. 10 Real versus apparent area theory 


DEPTH OF CUT =.200in. 
Fig.9 Orthogonal BUE data at constant feeds (4340 steel) 


Thus the difference in chip hardness can be explained by the lead 
affecting the tool forces and shear angle. 

Reviewing the foregoing discussion it appears that: 

1 The lead smears on the tool face and provides lubrication at 
the tool-chip interface. 

2 It was not determined whether or not the lead inclusions in- 
creased the number of active shear planes thereby producing a 
more homogeneous chip. 


The critical speed displayed by leaded steel is important when 
it is to be considered for some specific machining operation. 
Whether the economic cutting speed lies above or below the criti- 
cal speed is of prime importance in the final analysis since there 
is well-documented evidence [4, 12, 13, 14, 15, 21, 22] that the 
region characterized by decreased forces and yu is also one of in- 
creased tool life. Therefore it is advantageous to be able to de- 
termine the approximate location of the critical point as a func- 
tion of the machining variables. The previous discussions of the 
critical point determined it to be a strong function of cutting 
speed and material strength. It was also found to be possibly a 
function of feed. This information suggests that the tool tem- 
perature may control the critical point. The commonly used 
relation for tool temperature [20] is 


The cutting energy for orthogonal conditions is 
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Fig. 11 Lee and Shaffer plasticity solution for a = 0 deg 
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total cutting energy, in-lb per in. 

cutting speed, fpm 

feed, in. per rev 

thermal conductivity of workpiece, cal sec per cm deg C 
density of workpiece, gram per cu cm 

specific heat of workpiece, cal per gram deg C 

depth of cut, in. 


Or . 
Combining in terms of — since m is a “‘constant”’ 


m 
m "VS La Wipe 


(See Nomenclature for units.) 


0 
From the experimental data the critical point [*) ] was 
crit 
found to have a value of 8 x 104, 


4340 Steel 0.20 depth 


9.9 X 10* 
11.8 X 10¢ 
6.8 X 104 


Boston [2] gives a relation for the tangential force (F,) which 
holds approximately true at and above the critical speed, 


F, = Coftd’ 


and permits a solution of the temperature parameter in terms of 
feed. 


6, 
where Cy = function of strength level. 
Substituting average values for the conditions of this report 


z=08 b= 1.0 
Therefore 


Oy 
— = 0.069C,f"-* — 
m Vz 

This relationship is approximately true if the machining process 
does not deviate greatly from orthogonal cutting. 


Solving for Verit, it can be seen that the critical speed varies 
with the square of strength and less than proportionally to a 
change in feed. 

Having already seen that applications with low economic 
cutting speeds tend to be conducive to the use of leaded steel, we 
wish also to determine the effects of feed and hardness. If the 
Taylor tool-life equation (V7T"f* = £) holds in the region of in- 
terest, the economic tool life is given by 


Vers = 1.1 X wo 


r=a(+-1) where = ong 


zx = cost of machine and operator, cents per min 
T, = time to change tool, min 
y = cost of tool per cutting edge, cents 


Solving for the economic cutting speed ( V mia) 
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where a representative value of a is 0.8. Comparing this rela- 
tionship with the expression for V.,i:, it can be concluded that 
an increase in feed decreases Vin more than V,,- and therefore 
a high-feed application is more likely to benefit from leading the 
steel. Because a high ® decreases the economic cutting speed, a 
low machine and overhead cost or a high tool cost tend to in- 
crease the probability that leaded steel is advantageous. It is 
difficult to make a general conclusion as to the effect of strength 
level on use of leaded steel because the constants of the tool-life 
equation vary with strength. 

The above discussion may be a helpful method of determining 
when leaded steel can be applied to increase tool life. Because 
the cost of leaded steel is greater, it is not always true that 
higher tool life is sufficient to substantiate its use. An additional 
analysis such as given by Cook [5] is required for this purpose. 


Conclusions 


1 The effects of lead on the cutting mechanism were revealed 
only below a critical speed, which was found to decrease for in- 
creased workpiece hardness. 

2 The effects of lead include decreased machin ng coefficient of 
friction, increased shear angle, decreased shear energy, and lower 
cutting forces below the critical speed. 

3 An increase in cutting speed tended to decrease tool life 
more when machining leaded than when machining plain steel. 

4 The lead additive acting as a lubricant on the chip-tool 
interface was found to be the mechanism whereby the above 
effects were produced below the critical speed. 

5 It is proposed that the critical speed is directly related to 
the tool temperature. 
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APPENDIX 
Physical Properties 


The physical properties of a steel have pronounced effects upon 
its machining properties, therefore, this appendix was compiled 
primarily to facilitate correlation of the data and results pre- 
sented in this report with that of other investigators. 

The steel used was a split heat of 4340 hot-rolled 6-in. bar stock 
with a chemical composition as follows: 


Element 
Carbon 
Manganese 
Phosphorus 
Sulfur 
Silicon 
Nickel 
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Chromium 
Molybdenum 
Lead content of lead portion 


The heat-treatments used are described in the following table: 


Table of heat-treatment 
Time, 
Treatment hr 


235 Bhn 
Anneal 5 


285 Bhn 
Austenitizing 
Temper 


444 Bhn 
Austenitizing 


Temp, 


deg F Water quench Oil quench 


1500 Furnace cooled 


1500 
1100 


1500 to 500 F 


1100 F to room 
temp 


1500 1500 to 500 F 


700 700 F to room 


temp 


Temper 


The longitudinal microstructure of each of these hardness levels 
is given in Fig. 12. The photomicrographs were taken from the 
midradius of the 6-in., hot-rolled, bar stock at a distance of 6 in. 
from the end of the heat-treated bar. 

The 235-Bhn material consisted primarily of ferrite (white 
areas) and pearlite (dark areas), while the 285 and 444-Bhn 
specimens consisted of tempered martensite (dark areas), tem- 
pered bainite (gray areas), and ferrite (white areas). The band- 
ing consists of alternate concentrations of ferrite and was pro- 
duced by the hot-rolling process. These bands will have no ad- 
verse effect upon the machining process since the cutting was 
done across them. 
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A General Flow Graph Technique for the 
Solution of Multiloop Sampled Systems 


This paper presents a flow graph approach for analyzing multiloop sampled-data 

systems. Two techniques for finding the sampled output are examined. These are, 

first, construction of a ‘‘sampled” signal flow graph from the original system; second, 

a general gain formula which can be directly applied to the original system. The 

theorems necessary to establish the formula are rigorously proved. 7 
The first technique allows two possible modes of solution. The sampled output can 

be found directly from the sampled flow graph by the use of Mason's formula, or in case 

of a more complicated multiloop system, the problem of enumerating nontouching 


feedback loops can be simplified by the use of topological matrices. Techniques de- 


Introduction 


NUMBER OF AUTHORS have used the signal flow 
graph [1]'in analyzing a multiloop sampled-data system (2, 3, 4]. 
Recently, Lendaris and Jury [5] have found a general gain ex- 
pression which is an extension of Mason’s result for a linear con- 
tinuous system. However, it turns out that the formula of 
Lendaris and Jury is not valid in general. This paper describes a 
general signal flow graph technique for analyzing multiloop 
sampled-data systems. Two techniques for finding the sampled 
output are developed. These are, first, the construction of a 
sampled signal flow graph from the original system. The sampled 
output may then be found from the sampled flow graph by the 
direct application of any of the known techniques for the solution 
of multiloop continuous systems. In particular, the authors 
suggest the use of Mason’s general gain formula [1] or “topologi- 
cal” matrices [6]. The second technique developed in this paper 
allows the computation of the sampled output directly from the 
original system through the use of a general gain formula. 
Techniques for finding the unsampled output are also discussed. 

To formulate the problem let us examine the system whose block 
diagram is shown in Fig. 1. In this system a plant is controlled by 
means of a bypass digital controller [7, 8], the whole structure 
forming a single sampled-loop system. 


Fig. 1 Single sampled-loop system 


1 Numbers in brackets designate References at end of paper. 

Cuntributed by the Instruments and Regulators Division of Tue 
AmerIcAN SocteTty OF MECHANICAL ENGINEERS and presented at 
the ASME-AIEE-IRE-ISA National Automatic Control Conference, 
Dallas, Texas, November 4—6, 1959. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, August 
10, 1959. Paper No. 59—NAC-1. 
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veloped in the paper are also applied to the solution of multirate systems. 


A flow graph representation of the system of Fig. 1 is shown in 
Fig. 2 where for simplicity D(z) is denoted by D* and G(s) and 
H(s) by G and H, respectively. 2;, 2, and 2; are transforms of 
node variables appearing at nodes 1, 2, and 3. Nodes are placed 
at the intersection of branches and before and after a sampler in 
each sampled branch.? (If there are two or more samplers in a 
branch, they can always be reduced to one sampler followed by a 
sampled transfer function.) 


The system of Fig. 2 is characterized by the equations 
2,* (1) 
a, = D*GHz, + Gx, 


Finding the output of the system in terms of the, input and the 
system parameters is equivalent to solving equations (1) for 
2; or z;*. We shall use the sampled signal flow graph to simplify 


the solution. 
Derivation of the Sampled Graph 
In general, using the principle of superposition, the equations 


describing a multisampled-loop system may be written in matrix 
form as 


AX = BX* +R (2) 


where A and B are square matrices whose order is equal to the 
number of variables, X is the column vector of variables, and X* 
is obtained from X by replacing each variable by its sampled 
form. If Q is any matrix, the symbol Q* will be used to denote 
the matrix obtained from Q Ly replacing each element q,; of Q 

2 We shall use distinct symbols for the input and output variables 


ofasampler. If 2; is the input and z; the output, then z; = 2;* 
This procedure simplifies proofs which follow. 


| 
Fig. 2 Flow graph for the system in Fig. 1 
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* 
Xe 


by its sampled form q,,*. lf we restrict ourselves to single-input 
systems (multiple input systems may be easily handled by super- 
position) and adopt the convention that the input is connected 
to the node corresponding to the variable z;, then 


r 
0 
0 
0 


where r is the input. For the system of Fig. 1 


1 0 1 00 0 
A= 0 1 0 B=|1 00 
-G -D*GH 1 000 


If we solve equation (2) for X, we obtain 
X = A-'BX* + (3) 
Taking the sampled form of equation (3) we have 
X* = (A~'B)*X* + (A-'R)* (4) 


Equation (4) represents a set of simultaneous linear equations 
involving only the sampled variables as unknowns. The signal 
flow graph corresponding to equation (4) will be called the 
sampled graph. To construct the sampled graph we need to know 
the elements of the matrices A~'B and A~'R. The physical inter- 
pretation of A~! is described in the next section. For the system 
of Fig. 1 we have 


[1 -—D*GH -1 
A-! = ——/0 G 
1+ 0 
|G D*GH 1 
0 0 
= —— 1 G 00 
(5) 
| D*GH 0 0 
1 
AR = i+G 0 
Gr 


Therefore, equation (4) can be written as 


The sampled flow graph for this system is shown in Fig. 3. The 
output c* computed from the sampled flow graph is 
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If A = G/(1 + G), then equation (7) becomes upon simplification 
(Ar)* + (AH)*D*r* 
1 + (AH)*D* 


(8) 


which agrees with the result for the same problem obtained on 
page 188 of reference [8}. 


Efficient Construction of the Sampled Graph 


In this section we shall demonstrate that it is always possible 
to obtain the sampled graph without the necessity of matrix in- 
version. The required elements of A~'B and A~'R may be found 
by conventional signal-flow graph technique, as the following 
argument shows. 

Let A = [a,,;]. The matrix A then characterizes the equation 


+ Ait, +... + Aint, = 


+ +... + AenTn = 


. 


+ +... + = Yo 


We may interpret these equations as describing the original con- 
trol system with all samplers open-circuited. The forcing func- 
tion y, may be interpreted as a source connected to node k of the 
original graph. If a,,~' is the element in row é and column j of the 
matrix A~!, then 


‘ (10) 
|A| V; ] all other y's = 0 


where 
A, = the cofactor of a,, 
|A| = the determinant of A 


Thus a,,;~' is the response at node i of the original system due to 
a unit source applied at node j, with all samplers open circuited 
Let 


= Gy (11) 


where G,, is the transfer function from node j to node i of the 
original system with all samplers open-circuited. 
The following properties of the matrix B are easily verified. 


1 The elements of B are either 1 or 0. 

2 The element b,, of B will be 1 if and only if the variable z, 
is the input to a sampler and z, is the output of the same sampler. 

3 B contains at most one nonzero element per column. 
(Two samplers with the same input may always be replaced by 
a single sampler.) 

4 If b,; = 1, then column i of A~ appears as column j of 
(AB). 


Gey 


GH 
-0"\i+e 


Fig. 3 Sampled flow graph for the system in Fig. 1 
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R= 
‘ 
Be 
z,* r (9) 
X=/| 2 x* = R=/0 
is 

GH \* Gr 
(; + (j + 3) 
i+ 
GH 

* Gr 
i+G 


5 If z, is not the input to a sampler, column j of (A~'B) con- 
sists entirely of zeros. 


From these properties and equation (11) we may make the fol- 
lowing statement: 


If the variable z, is the input to a sampler and the variable z;, is 
the output of the same sampler, then column j of A~'B appears 


as follows: 
AW. | (12) 


If x, is not the input to a sampler, then column j of A~'B consists 
entirely of zeros. 

From equation (11), the matrix A~'R has the form 
Gur) 
Gur 


A-R =| (13) 


L 
Equations (12) and (13) allow the sampled graph to be con- 
structed efficiently from the original graph. 


Illustrative Example | 


To illustrate the technique of efficient sampled flow graph con- 
struction let us consider a multisampled-loop system shown in 
Fig. 4. The flow graph representation of this system is shown 
in Fig. 5. 

We observe that z, and z¢ are the only variables at the input to 
samplers. Therefore we can write directly from Fig. 5 that 


0 00 Gud 
Gn 000 0 Gp 
9 00 0 (4) 
Gu 0000 Ge 
Gs 000 0 Gs 
0 00 0 Gel 


Evaluating the elements of A~'B we have 


Fig. 5 Flow graph for the system in Fig. 4 
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0 0 0 
1 0000 0 
& © 000 0 (15) 
0000 
09 0000 1 
| GG: 0 0 0 0 


For the system under consideration, G,; = 0 for i # 1 and there- 
fore 


0 

(16) 
0 


0 


In view of these relations, equation (4) can be represented by 
a sampled flow graph shown in Fig. 6. The output c*, which can 
now be computed by the direct application of Mason’s formula 
{1} or by the use of topological matrices [6], is 


r*(G,G.)* 
+ + (G2H2)* 


Iustrative Example ( Multirate System) 


In this example we shall illustrate the application of the 
sampled flow graph to the solution of systems containing samplers 
with periods 7'/p,, T/p2, T/ps . . . where p,; is a positive integer. 
It has been shown elsewhere [9] that each sampler operating with 
a period T/p,; can be replaced, using the “switch decomposition 
technique” by a system containing advance and delay elements 
together with a number of samplers all operating with period 7’. 
Thus any multirate system can be replaced by an equivalent 
single rate system possessing a more complicated structure. It 
has been shown [9, 10] that the transform technique of solution 
of such systems leads to simultaneous algebraic equations one of 
whose unknowns is the transform of the output. The use of the 
sampled flow graph avoids the problem of solution of simultane- 
ous equations. 

Without any loss of generality let us consider the simple multi- 
rate system shown in Fig.7. The equivalent single-rate system is 


(17) 


-(6,62)" (5,62) 


Fig.6 Sampled flow graph for the system in Fig. 4 
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Fig. 7 A simple multirate system 
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shown in Fig. 8. Fig. 9 shows the flow graph representation of 
this system. 

Since 2,, 2, and 2, are the only variables at the input to sam- 
plers, therefore 


Ge Gs 07 
Gu Ge Gx 


and similarly 


Substituting the expressions for A~'B and AR into equation 
(4) we obtain 


z,* = —H*z,* + r* ) 

z,* = z,° 

= G*z,* + (0.G)*z,* (21) 
= —(H0,)*z,* + (0,r)* 

= z,* 

= z;* 


The sampled flow graph representing equations (21) is shown in 
Fig. 10. The expression for the sampled output, c*, which can be 
computed using Mason’s formula or topological matrices, is 


_ + 
1 + G*H* + (0,G)*(0,H)* 


(22) 


A General Gain Formula 


It is possible to bypass the sampled graph entirely and compute 
the sampled form of the output directly from the original system. 
Rearranging equation (4) for convenience, 


— (AB)*)X* = (A-'R)* (23) 


where J is a unit matrix whose order is equal to that of A and B. 

If the output c is the node variable z,, then using equation (13), 
we have 

(Gur)* Ain (Gir)* Aon 
= 

A A 
where 
A = determinant of J — (A-'B)* 


Aim = cofactor of the element in row i and column n of J — 
(A-"B)* 


(G,,7)*A,, 
4) 
(2 


The determinant A and the various cofactors A;, may be com- 
puted by applying Mason’s formula [1] to the sampled graph. 
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¢ 


Fig. 8 Single-rate equivalent of the system in Fig. 7 


Fig. 10 Sampled flow graph for the system in Fig. 8 


In order to avoid the construction of the sampled graph, we must 
know the mechanism by which the feedback loops and forward 
paths occurring in Mason’s formula are formed. 

(a) Feedback Loops. We shall first establish the following re- 
sult: 

If a branch originates from node z,* of the sampled graph, then 
z, must be the input to a sampler in the graph representing the 
original system. 

To justify this statement, suppose that a branch 6,, originates 
from node j and terminates on node k of the sampled graph. From 
(4) and (13) the equation at node k of the sampled graph is 


z,° = + +... 4+ +...4+ 
+ (Gur)* (25) 


where ¢,,* is the element in row k and column j of (A~'B)* and 
is also the weight of the branch b,. From the previously dis- 
cussed properties of the matrix B, it follows that if z,; is not the 
input to a sampler, then column j of (A~'B) consists entirely of 
zeros, and hence t;,* = 0. Thus if z; is not the input to a sampler, 
no branch b,, can exist in the sampled graph. 

We shall now adopt the convention of numbering the samplers 
so that z, is the input to sampler j. For example, if a system con- 
tains three samplers, whose inputs are 2z;, z;, and 2, then we shall 
call the samplers S;, S;, and S¢, respectively. We shall denote by 
Gs the gain from the output node of sampler S, to node k, with 
all samplers open-circuited. We shall denote by Gs,s; the gain 
from the output node of S; to the input node of S,, and G,s¢ the 
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Pal 
Gu 0 Ge Gs 0 0 (18) 
Gu 0 Gu Gu 0 0 re) 0. 7 at 
: 2 
Gy 0 Ge 0 0 
Gu 0 Gu Gu 0 0 & 

Computing all G,,’s from the graph we obtain T - 

00 -H O 0 0 
10 0 0 00 Xe 
AB = Go 0 0G 0 0 (19) an 
0 0 —-HO, 0 00 
0 1 00 
( T 0,6 
0 1 ) 0 0 1 2 
Ks 
r 
AR =|? (20) 
Our 
0 Fig. 9 Flow graph for the system in Fig. 8 : 
0 
Or 

| 
j 

. 


gain from node i to the input node of S,, with all samplers open- 
circuited. 


We shall now show the following: 
If nodes 1, 2, .. ., m, 1 are connected in that order by a feedback 
loop of the sampled graph, the weight of the feedback loop is 


To establish this result, we note that if nodes 1, 2, . . . m of the 
sampled graph are connected by a feedback loop, each node must 
be the input to a sampler. According to our convention, the 
samplers associated with nodes 1, 2, ..., m are S,, S:,..., S,,. 
Suppose that a branch b,, originates from node j and terminates 
on node k, where z, is the input to sampler S, and k is the input 
to sampler S,. From the properties of the matrix B, and from 
(12); column j of (A~'B) appears as follows: 


---Gga--- 
= | ---Gsa--- (26) 


---Gs~n--- 


The weight of b,, is the element in row k and column j of (A~'B)*. 
Hence the weight of b, = Gsj* = Gs;s,*." The feedback loop 
under consideration is formed by branches bes, Dm—1.mDmi- 
Hence the weight of the loop is 


Gs,s,°Gs,s,* eee 


as has been asserted. 
From this discussion, the weight of a feedback loop of the 
sampled graph must be of the following form: 


Gs,s;*, etc. 


According to Mason’s formula [1], the determinant of a system 
of linear equations described by a signal flow graph is given by 


where P, is the sum of the weights of all feedback loops, and 
=P; (i > 1) is the sum of the products of the weights of the non- 
touching feedback loops, taken 7 at a time. 

Applying Mason’s formula to the sampled graph, we obtain 
the following expression for A in equation (24): 


A=1- Gs,s;* + 1/2 


+ 1/3 + 


i,j, k 


+ — (28) 


where 2P, is the sum of the products of the weights of the distinct 
nontouching feedback loops in brackets, taken k at a time. The 
number m is the maximum number of distinct nontouching loops. 
Two loops of the sampled graph will be nontouching if the expres- 
sions for the weights of the loops, as appearing in (28), have no 
indexes in common. For example, the terms Gs,s,*@s,s,* and 
Gs,s,*@s,s,"Gs,s,* correspond to nontouching loops. 

The appearance of the factors 1/2, 1/3, etc., in (28) may be 
explained by noting that the terms 


Gs,s,*Gs,s,*Gs,s,*, Gs,s,*Gs,s,*Gs,s,* 


+ In view of (25), and the fact that t;,.* = 0 if z; is not the input 
to a sampler and tj,* = Gs,«* if z; is the input to sampler S;, the 
sampled graph may be constructed directly from the original system 
without writing any equations. The procedure previously described 
for the construction of the sampled graph seems to be more sys- 
tematic and therefore preferable in more complicated systems. 
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all represent the same feedback loop. All three terms appear, 
however, in 


(b) Forward Paths. Let us now describe the nature of forward 
paths of the sampled graph. Suppose we wish to find all for- 
ward paths from node i to the output node (node n). Fora 
forward path to exist at all node i must be an input to a sampler, 
which by convention we call S,. Then the following terms may 
appear as weights of forward paths: 


Gsin*, Gsis,°Gsn*, 
(29) 


etc. 
Note that the sampler S, will never appear in (28), since the 
last branch of a forward path always terminates on node n. The 
only forward path from node n to node n has weight 1. 

According to Mason’s formula [1], the cofactor of the element 
in row 7 and column n of the determinant of a system of linear 
equations described by a signal flow graph is given by 


(30) 


where F, is the weight of a forward path from node ¢ to node n, 
and A, is the expression (27) evaluated for the graph obtained by 
removing all branches touching any of the nodes of F,. 

Again applying Mason’s formula to the sampled graph, we ob- 
tain the following expression for A,,, in (24). 


A, = (31) 


where F,; is one of the terms in (29) and A, is the expression (28) 
evaluated with all terms involving S, or samplers appearing in 
F , deleted. 

The terms (G,,r)* in equation (24) need be computed only if 
node ¢ is the input to a sampler. Accordingly, for each sampler 


S,, we shall have to compute a term (Cito in (24). 


lilustrative Example tll 


As an example, consider the system shown in Fig. 11. Follow- 
ing Lendaris and Jury [5], define 


G, Ll GG, 


1+ GOGH, “1+ GGGH, 


Fig. 11 Illustrative example Ill 
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By (28) 
ia 
1 — [Gs,s," + Gs,s,* + Gs,s,* + Gs,5,°Gs,s,* + Gs,s,°Gs,s,° 
+ Gs,s,°Gs,s," + + 
+ Gs,s,*°Gs,s,* + Gs,s,°Gs,s,* + Gs,s,°Gs,s," 
+ + 


+ — (32) 


By inspection of the original system we obtain 
Gs,s," -D,*L,°, —D,*L,°, 

Gs,5, 
= —D,*L,*, 


Gs,s," 


*=| —D,*L,", 


Gs,s, Gs,s," 


* =| —D,*L,*, 
Gs,s,* 


Gs,s,* = —D,*l,"*, Gs,s; 


Substituting (33) into (32) we obtain 
4 =1- [—D,*L,* — D,*l,* — L,* + 
+ D,*L,*L,* + — — 
— + 


+ D,*L,*L,* + D,*L,*L,* — 3D,*D,*L,*L,*L,° 
+ 
Simplifying, 
A = 1+ D,*L,* + D,*L,* + L* (34) 
The numerator of c* is, by (24), 
(Gis,r)*An + (Gisyr)*An + (Gis,7)* An (35) 
We have Gis, = Ih, Gis, = In, Gis, = Ls. 
By (31), (29), and (32), 
An = Gsy*(1 — Gs,s,*) + 
An = Gsy*(1 — Gs,s,*) + Gs,s,°Gs* (36) 


An = 1 — [Gs,s,* + Gs,s,* + Gs,s,°G@s,s,*] 
+ Gs,s,*Gs,s," 
From the original system 
—D,*L,*, 
Hence (35) becomes 
+ D,*L,*) + 
+ + D,*L,*) + 
+ — (—D,*L,* — + D,*L,*D,*L,*) 
+ D,*L,*D,*1,*| = — (rl,)*D,*L,* 
+ (rLs)*(1 + + 


Gsp* = —D,*L,* 


and finally 


output is obtained by matrix multiplication using (3). It ap- 
pears that, if a general gain formula for the unsampled output 
existed, it would be cumbersome and difficult to apply. 


IMustrative Example 


We shall find the unsampled output of the system of Fig. 1. 
From the sampled graph of Fig. 3 we obtain 


(38) 


From (3) and (5) we obtain 


Zs D*GH 0 0 Gr 
or 
D*GH Gr 
—— 39 
Zi c i+@ i+@G ( ) 


Substituting from (38) we obtain 


a) 


D*GH 


= 


(40) 


3) G 
Conclusions 


This paper has presented two systematic techniques which 
may be applied without restriction to any multiloop single or 
multirate sampled-data system. For complicated systems, the 
techniques are appreciably more efficient than the standard alge- 
braic approach. In the opinion of the authors, it is easier to con- 
struct the sampled flow graph than to apply the general gain 
formula directly. It is found that in most cases the flow graph 
techniques are not “‘minimal’’; expressions occur which contain 
identical terms with positive and negative signs. Such terms 
have no net contribution to the system output. This suggests the 
possibility of improving the efficiency of the techniques presented 
in this paper. 
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.. The unsampled 


—(rL,)*D,*L,* — + (rLs)*(1 + D,*L,* + 
(rI,)*D,*Ls* + + (rly)*L,* 
= 
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DISCUSSION 


Benjamin C. Kuo‘ 


The authors have presented a very unique and useful signal 
flow graph approach for analyzing multiloop sampled-data sys- 
tems. Once the system node variables are properly designated, 
the matrices A~!B and A~!R can be obtained by inspection from 
the original flow graph. The sampled flow graph is constructed 
by using A~'B, A~!R, and Equation (4). The sampled output c* 


‘ Coordinated Science Laboratory & Electrical Engineering Depart- 
ment, University of Illinois, Urbana, IIl. 


may then be computed irom the sampled flow graph by the use of 
Mason’s gain formula. 

The authors pointed out that in order to compute the un- 
sampled output c, it is in general necessary to compute all 
the sampled node variables z,*, z2*,..... z,*. The unsampled 
variables, including the unsampled output c* are then obtained 
by matrix multiplication using Equation (3). However, since 
Equation (3) indicates that the unsampled variables are related 
to the sampled variables, it suggests that a composite signal 
flow graph may be constructed which is composed of the original 
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Fig. 12 Composite signal flow graph of sampled-data system in Fig. 1 


Fig. 13 Composite signal flow graph of sampled-data system in Fig. “ 
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flow graph and the sampled flow graph. For instance, the com- 
pos'te signal flow graph for the system in Fig. 1 is shown in Fig. 
12. The composite graph consists of the sampled graph of Fig. 3 
and the original graph of Fig. 2. The sampling switch of the 
original flow graph is omitted, and the sampling operation is de- 
scribed by drawing a branch of unity gain from node z,* to node 
2 representing the relation rz. = z,*. From this composite flow 
graph, all the variables x3, 2;*, and can be computed 
by means of Mason’s formula. Thus from Fig. 12 the unsam- 
pled output is 


* 
D* 
(; + ‘ 


i+G 


which agrees with Equation (40). 
Similarly, the composite signal flow graph for the system in Fig. 
4 is constructed in Fig. 13. The unsampled output is obtained 
directly by applying Mason’s formula. 
G,G,{1 + (G.H-)*\r* — r*( 


(42) 


1 + (GiG,)* + (G.H:)* 


It is apparent that all the other unsampled variables can also 
be obtained by the same procedure. 


George G. Lendaris®.° 


The authors have successfully formulated an accurate proce- 
dure for determining the sampled output of a general multi- 
sampled-loop system whose samplers are in synchronism (multiple 
or submultiple rates included). The discusser was quite pleased 
to note that another group of investigators also considered this an 
important problem. 

The authors suggested in the conclusion of their paper that the 
efficiency of the techniques could perhaps be improved. It is the 
aim of this discussion to present an efficient technique for carry- 
ing out this procedure. 

As the authors mentioned in their introduction, the discusser’s 
work and resulting paper in this area’ led to a formula that turned 
out not to be completely general. However, in pursuing this 
work further, the discusser developed an accurate general tech- 
nique for determining the sampled and continuous output for 
these systems. This latter work was included in the paper’ as a 
closure to supersede the method given in the main body of the 
paper. 

In general, it is found that the topological method is more ef- 
ficient than the algebraic formula method. This is true because 
most engineering problems are such that the system determinant 
contains some zero terms. The algebraic formula method by its 
nature does not take advantage of this fact before the calculations 
are made, whereas the topological method does. The latter does 
because for every zero term in the determinant, there is a branch 
missing in the flow graph; hence this term does not enter into the 
calculation procedure. For the case where all the terms of the de- 
terminant are nonzero, the algebraic and topological methods are 
equally as efficient (or inefficient, as the case may be). 

The general approach to this problem (as the authors have also 
indicated) is to represent the given system block diagram by 

’ Department of Electrical Engineering, University of California, 
Berkeley, Calif. 

* This work was supported by the United States Air Force through 
the Air Force Office of Scientific Research of the Air Research and 
Development Command, under Contract No. AF 18(600)-1521. 
Reproduction in whole or in part is permitted for any purpose of the 
United States Government. 

7G. G. Lendaris and E. I. Jury, “Input-Output Relationships for 


Multi-Sampled-Loop Systems,’ AJEE Applications and Industry, 
January, 1960. 
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some sampled-signal flow graph, and then use standard tech- 
niques to proceed from the flow graph to the desired result. As- 
suming an efficient method for obtaining the desired result from 
the signal flow graph is available,* the problem reduces to that of 
finding an efficient signal flow graph representation of the given 
system. 

In the discusser’s opinion, for a system with n samplers, the 
best representation for the system should be a sampled signal flow 
graph with n + 2 nodes; one for each sampler, one for the input, 
and one for the output. The discusser derived such a form in the 
previously mentioned work,’ and will present the results here. 

The following is the proposed procedure for transforming the 
block diagram representation of the system into its efficient 
sampled signal flow graph representation. 


(1) Eliminate all redundant samplers in the original block 
diagram. This step is not necessary for correct results, but it 
simplifies the resulting flow graph. 

(2) Number all samplers (in any fashion desired). 

(3) On a separate sheet of paper draw a node for each sampler 
indicated in step (2) and label these S,*, S,*, .. ., S,*. 

(4) Open all samplers in the block diagram. 

(5) Start from sampler (1) and traverse all possible continu- 
ous paths from the output of sampler (1) to the input of all 
samplers. Where these exist, draw in a branch on the signal flow 
graph from node S,* to the respective node S,* with the direction 
and the appropriate sampled transfer function indicated. Then 
start at sampler (2) and repeat procedure until all samplers are 
covered. 

(6) Now add two other nodes, one labeled with a 1 (the input), 
and one labeled with a C (the output). From the input node draw 
a branch to each of the nodes which has a corresponding con- 
tinuous path in the block diagram, and label with the appropriate 
gain (RG to the output, and the RG;,,* to the other nodes). 
Then draw a branch from each of the nodes S,* which has a 
corresponding continuous path in the block diagram to the output 
node C and label appropriately (G,,0’). 

Then to obtain the output of the system. 

(7) Apply Mason’s or Coates’ method to the nodes marked C 
and 1 of this sampled-signal-flow graph. 


If the sampled form of the output is desired, one may use the 
standard technique of starring the expression for C obtained in (7). 

To illustrate its application, this procedure will be applied to 
some of the examples given in the paper. 

Consider successively the systems of Figs. 1, 4, and 8 of the 
paper, and in addition the system of Fig. 14. By applying rules 
1-6 to these systems, the corresponding sampled signal flow 
graphs can be drawn directly as shown in Figs. 15, 16, 17, and 18, 
respectively. By applying Mason’s theorem to these flow graphs, 
one may write by inspection the expressions for the output: 


Fig. 15: 
(45) 
1+G 1+G 


(43) 


§ Mason’s method referred to in the paper will serve this purpose. 
It should be pointed out that more recent work has been done in this 
area by C. L. Coates, “Flow Graph Solutions of Linear Algebraic 
Equations,” TRE PGCT, June, 1959.8 The method he presents is 
guaranteed to give a minimal term expression for the gain, and for this 
reason is in many cases superior to Mason's method. However, for 
the types of flow graphs the control engineer is likely to encounter, 
the advantage is very little. 

* For the person who does not wish to pore over the very elegant 
topological exposition given by Coates, C. A. Desoer has written a 
very simplified version of the same work with the same results: 
“The Optimum Formula for the Gain of a Flow-Graph, or, a Simple 
Derivation of Coates’ Formula,” to appear in [RE Proceedings, 
March, 1960. 
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Fig. 17 Sampled-signal-fiow graph of Fig. 8 
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Fig. 15 Sampled-signal-flow graph of Fig. 1 


Fig. 19 Reduced form of Fig. 11 
Fig. 16 Sampled-signal-flow graph of Fig. 4 


Fig. 16: The sampled form of Equations (43), (44), and (45) reduce, re- 
spectively, to Equations (7), (17), and (22). 

R*(G.:G(1 + G.H2*) — G,G.*G:H:) Now consider the system of Fig. 11 in the paper. From the 

ih 1+ GG.* + GH* sampled signal flow graph of this system, it is noted that this is 

somewhat of a special system in that none of its determinant terms 

is zero. Therefore this method has no particular advantage over 


Fig. 17: 


R*G(1 + HO,*0.G*) — R*G*HO,*0.G + RO,*0.G(1 + G*H*) — RO,*0,G*H*G 
1 + G*H* + HO,*0.G* 


(45) 
Fig. 18: 


R*G,*G,*G, 
G,H,* + H2*G.H,* + + Gi*G:"G,H,* + 


(46) 
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Fig. 20 Sampled signal flow graph of Fig. 19 
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Fig. 21 Effect of (analytically) moving a sampler through a D* box 


that of using the authors’ Equation (32), and the continuous ver- 
sion of Equations (36). However, if the analytical trick” of mov- 
ing samplers S, and S, through their respective D* boxes is used, 
there results the one sampler system shown in Fig. 19. Its 
sampled signal flow graph is given in Fig. 20. 

The expression for the output can now be written directly as 


+(RL,*D,* + + 


ry 


(47) 


The sampled form of Equation (47) reduces to Equation (37). 
This last example points out that for better efficiency, Rule 1 
should be applied to its fullest extent. 


Julius T. Tow"? 


The authors have written a very interesting paper on the de- 
termination of the output transforms of multiloop sampled sys- 
tems by use of the signal flow graph techniques. The methods 
described in this paper are certainly more general and appear to 
be more useful than the other methods in the literature [2, 3, 5]. 
The general gain formula, Equation (4), derived in this paper 
appears quite complicated, and thus is not too convenient to ap- 
ply. In this respect, the sampled-graph technique seems to be a 
better method. However, it should be pointed out that to derive 
the sampled-graph (or pulsed-signal flow graph as called by this 
discusser) the use of matrix manipulation does not lead to simpli- 
fication, but it may even complicate the problem which may be 
readily solved otherwise. 

The rules of reduction for conventional signal flow graphs can- 

% One might argue that physically this step has no meaning, since 
by its usual definition D* must have its input in sampled form. How- 
ever, one may note that analytically the input-output relationships 
of Fig. 21(a and b) are both the same: 

C = R*D* 
11 Associate Professor, Purdue University, Lafayette, Ind 
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not be applied directly to sampled-data systems, because of the 
presence of samplers in the flow graph. One way of making the 
conventional rules directly applicable is to reduce the conven- 
tional flow graph to one for pulsed signals only. Thus, this 
method of determination of the output transforms of multiloop 
sampled-data systems generally involves two steps: 


1 Derivation of a pulsed-signal flow graph from the correspond- 
ing conventional signal flow graph. 

2 Determination of the over-all pulse-transfer function from 
the pulsed-signal flow graph by use of the rules of flow graph re- 
duction. 


The second step is quite straightforward. Clearly, the first step 
can be easily accomplished when the set of equations 


n 


is obtained. In the Equation (48), z,* is the pulsed signal from 
the sampler following node j which will flow .o node k; a, is the 
transmission from the sampler following node j to node k; R, is 
the signal applied at node n which will flow to node k; 8, is the 
transmission from node n to node k; and z, is the resulting signal 
at node k. By applying the principle of superposition, Equation 
(48) follows immediately by inspection of the conventional signal 
flow graph describing the sampled-data system. Evidently, no 
matrix is needed, although one can use it if he likes to. In set- 
ting up Equation (48) the use of flow graph reduction rules is 
sometimes of great help, as demonstrated in an example which 
follows. Taking the starred transforms (or Z transforms) of 
both sides of Equation (48) yields 


= a,;*z;* + > (49) 
n 


The derivation of the pulsed-signal flow graph from Equation 
(49) is a simple matter. This is the method which is currently 
used in the digital control course offered at School of Electrical 
Engineering, Purdue University. 


Fig. 22 
As an illustration, the sampled-data system of Figs. 1 and 2 of 
this paper is considered. To facilitate the writing of the equation 


for 2, the signal flow graph of Fig. 2 is reduced to the form of Fig. 
22 by well-known rules. Inspection of Fig. 22 yields 


In like manner, the signal flow graph of Fig. 2 is reduced to the 
form of Fig. 23 in order to facilitate the derivation of the equation 


(51) 


The equation for z, is obvious. 


2,* (52) 
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Fig. 23 


Taking the starred transforms, these equations are reduced to 
Equation (6) of the paper from which the pulsed-signal flow graph 
can readily be drawn. Clearly, following this discussion, a con- 
trol engineer can derive the pulsed-signal flow graph and then the 
output transform of a multiloop sampled-data system with ease, 
even though he is not armed with a sound knowledge of matrix 
manipulation and topology. 


Authors’ Closure 


The authors wish to thank the discussers for their very interest- 
ing and pertinent comments. As Dr. Kuo and Mr. Lendaris 
point out, the unsampled output can indeed be found without the 
necessity of matrix multiplication. The application of the ‘‘com- 
posite” signal flow graph is completely systematic and leads 
efficiently to a solution. Dr. Kuo’s procedure is essentially 
equivalent to that given by Mr. Lendaris, as may be seen by 
comparing Figs. 13 and 16. In Fig. 13 we may delete nodes 


370 / suNE 1960 


and x5*, and also all branches incident on these nodes. 
Since the nodes in question do not correspond either to the system 
input, the system output, or to the input of any sampler, their 
removal has no effect on the computation. The original flow 
graph may be replaced by the single node labeled c, with branches 
connecting z,* and z¢*, the weights of the branches being G,G; 
and —G2H2, respectively. If this modification is carried out, 
Fig. 13 becomes identical with Fig. 16. However, it may be 
desirable to retain the extra nodes in the flow graph; for example, 
if we wish to consider some other points in the system as the 
output, we are spared the necessity of drawing another graph. 

Professor Tuo suggests that the use of matrix algebra in the 
derivation of the sampled flow graph may result in complications. 
The authors emphasize that they have used matrix methods only 
for the purpose of justifying the final results. In practice, the 
elements of the sampled flow graph are found not by inversion of 
matrices, but by applying Mason’s formula to compute various 
transfer functions of the original system, with all samplers open- 
circuited. It is clear that if the equations describing the system 
can be manipulated into the form (48), the sampled flow graph 
may be drawn by inspection. However, the authors do uot favor 
the use of flow graph reduction rules to accomplish the manipula- 
tion. The application of the rules requires judgment and in- 
tuition; in the analysis of complicated systems it is quite possible 
to go astray. The authors believe that Mason’s formula is more 
straightforward and efficient. 

In response to Professor Tuo’s closing remarks, the authors 
suggest that a control engineer should be armed with a sound 
knowledge of matrix theory and elementary network topology. 
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Control System Analysis and Design 
Via the “Second Method’ of Lyapunov 


Continuous-Time Systems 


The ‘second method’’ of Lyapunov is the most general approach currently in the theory 
of stability of dynamic systems. After a rigorous exposition of the fundamental con- 
cepts of this theory, applications are made to (a) stability of linear stationary, linear 
nonstationary, and nonlinear systems; (b)estimation of transient behavior; (c) control- 


system optimization; (d) design of relay servos. 


The discussion is essentially self- 


contained, with emphasis on the thorough development of the principal ideas and 


mathematical tools. 


Only systems governed by differential equations are treated here. 


Systems governed by difference equations are the subject of a companion paper. 


1—Introduction 


0... OF THE MOST important events in the theory of 
stability of dynamic systems was the publication in 1892 of 
Lyapunov’s famous memoire in a Russian journal. Translated 
into French in 1907 and reprinted in America in 1947 [1],* it is 
still relatively unknown and unappreciated in the West. By con- 
trast, the so-called ‘‘second method”’ of Lyapunov has achieved 
virtual pre-eminence in the Soviet Union as the principal mathe- 
matical tool in tackling linear and nonlinear stability problems of 
the most varied type, particularly in the theory of control sys- 
tems. 

The purpose of this paper is to give a self-contained exposition 
of the second method of Lyapunov, with particular attention to 
applications in the theory of control. We shall treat here only 
continuous-time dynamic systems; i.e., those governed by ordi- 
nary differential equations. In a companion paper, an analogous 
treatment is given of discrete-time dynamic systems. 

The objective of the so-called ‘‘second method” of Lyapunov is 
this: Jo answer questions of stability of differential equations, 
utilizing the given form of the equations but without explicit knowl- 
edge of the solutions. The name “‘second method” is an un- 
fortunate but well-entrenched historical misnomer‘; actually the 
“second method”’ is more accurately described as a point of view, 
a philosophy of approach, rather than a systematic method. At 
present, much depends on the ingenuity of the user. In the future, 
we can hope that systematic procedures will be made possible by 
machine computation. Yet the concept of a Lyapunov function 
(see below) appears to be very basic; the whole theory of control 
systems can be studied in a unified way using Lyapunov functions, 
as we shall show in what follows. Being a relatively new point 
of view, the “second method’”’ has much promise for the further 


! This research was supported in part by the U.S. Air Force Office 
of Scientific Research under Contract AF 49(638)-382. 

? 7212 Bellona Ave. 

3? Numbers in brackets designate References at end of paper. 

‘The so-called “first method" of Lyapunov deals with stability 
questions via an explicit representation of the solutions of a differen- 
tial equation. 

Contributed by the Instruments and Regulators Division of Tus 
American Society oF MecuanicaL Enarneers and presented at 
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development of control theory, particularly as regards nonlinear 
systems. 

The principal idea of the second method is contained in the fol- 
lowing physical reasoning: If the rate of change dE(x)/dt of the 
energy E(x) of an isolated physical system is negative for every 
possible state x, except for a single equilibrium state x,, then the 
energy will continually decrease until it finally assumes tls minimum 
value E(x,). In other words, a dissipative system perturbed from 
its equilibrium state will always return to it; this is the intuitive 
concept of stability. 

In order to develop this idea into a precise mathematical tool, 
the foregoing reasoning must be made independent of the physical 
concept of energy. As a rule there is no natural way of defining 
energy when the equations of motion are given in a purely mathe- 
matical form. The mathematical counterpart of the foregoing 
reasoning is the following conjectured theorem: A dynamic sys- 
tem is stable (in the sense that it returns to equilibrium after any 
perturbation ) if and only if there exists a ‘‘ Lyapunov function,” i.e., 
some scalar function V(x) of the state with the properties: (a) V(x) 
> 0, Vin) < Owhenx # x, and (b) V(x) = Vix) = O when x = 
x,. As will be seen (Theorem 1 and 2), this conjecture is correct 
under slight additional mathematical restrictions. Occasionally, 
but not necessarily always, it will suffice to take V = Z. 

One of the attractions of the ‘‘second method”’ is its appeal to 
geometric intuition. This is illustrated by the following two ex- 
amples. 

Example 1. Harmonic Oscillator. Normalizing the angular fre- 
quency to 1, a harmonic oscillator is described by the linear dif- 
ferential equation 


A+m=0 (° =d/dt) 


Letting 7; = 22, this equation can be written in the equivalent 
form 


(1) 


The state of the system is represented by the vector x = (21, 22). 
The solutions (trajectories) of (1) in the state-plane (phase-plane) 
are concentric circles about the origin, Fig. 1. The energy of the 
system is given by 


E(x) = E(x, 2%) = 2:7 + 22? = V(x) (2) 


The derivative V of V along any solution of (1), i.e., along any 
trajectory in Fig. 1, is given by 
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Fig. 1 Phase-plane trajectories of harmonic oscillator 


= 


x’ [grad V(x)] 


= 
* at 


= 22,2, + 


which is identically zero by (1). Hence V(x) = 0; the energy 
remains constant—(1) is a conservative system. 

Example 2. Asymptotically Stable Nonlinear System. Consider 
now a slight modification of system (1): 


= Ze — + 227) 
(3) 


= — + 2”) 


where a is a positive constant. 

We again define V by (2), but now of course this quantity is 
not necessarily the energy of system (3). Calculating as before, 
the derivative V of V along a solution of (3) is 


Vix) = ae = —2a(z,* + 2:7) (4) 


= — 2aV%x) 


which is negative unless z;} = z. = 0. This shows that V is con- 
stantly decreasing along any solution of (3), and hence V is a 
Lyapunov function. 

With reference to Fig. 2, it follows also that the trajectories of 
(3) cross the boundary of every region V(x) < const from the out- 
side toward the inside. From this we are led to the following 
geometric interpretation of a Lyapunov function: 

Let V(x) be a measure of the ‘‘distance’’ of the state x from the 
origin in the state space, i.e., V(x) > O when x # O and V(O) = 0. 
Suppose the distance between the origin and the instantaneous state 
x(t) is continually decreasing as t —> @, i.e., V(x(t)) <0. Then 
x(t) + 0. 

In Examples 1-2, V(x) is actually the square of the Euclidean 
distance. In general V(x) = const turns out to be (because of 
Vix) < O)a family of concentric closed surfaces surrounding the 
origin such that the surface V(x) = c, lies inside the surface V(x) 
= cz whenever c; < cz. Thus we can say that y is “farther” away 
from the origin than x if V(y) > V(x). 

Unlike the energy of a system, the Lyapunov function V(x) is 
not unique; this is precisely the reason why the ‘‘second method”’ 
of Lyapunov is a more powerful tool than conventional energy 
considerations. A system whose energy E decreases on the 
average, but not necessarily at each instant, is stable but Z is not 
a Lyapunov function. 
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Fig. 2 Phase-plane trajectories of harmonic oscillator with nonlinear 
damping 


2—Outline of Contents 


Sections 4-6 together with the Appendix provide the necessary 
mathematical machinery. Sections 7-10 contain the applica- 
tions. 

Description of Dynamic Systems (Section 4). To apply the 
“second method,” it is necessary to describe dynamic systems 
from the “‘state”’ point of view. Since this may be unfamiliar to 
many control engineers, an outline of the most important con- 
cepts and definitions is provided. 

Definitions of Stability (Section 5). In treating general types of 
systems (nonlinear, nonstationary) it is necessary to make careful 
distinctions between various intuitive aspects of stability. The 
necessary definitions are stated precisely and in accordance with 
current mathematical usage. 

The Main Theorems (Section 6). Theorem 1 gives the condi- 
tions on a Lyapunov function whose existence guarantees the 
strongest type of stability. In the corollaries of Theorem 1, these 
conditions are stated in various weakened forms suitable for cer- 
tain applications. Theorem 3 summarizes the current state of 
knowledge concerning the stability of linear, nonstationary sys- 
tems; Corollaries 3.1-2 provide a purely algebraic procedure for 
solving many problems in linear stationary systems, which is 
simpler than the customary methods based on the Laplace trans- 
form and is well suited for machine computation. All proofs of 
this section may be omitted at first reading without loss of con- 
tinuity. 

Applications to Stability Theory (Section 7). An easy conse- 
quence of Corollary 3.1 in the linear, stationary case is a rigorous 
elementary derivation of the Routh-Hurwitz inequalities (Ex- 
ample 6). The paper and pencil derivation is very messy, but the 
method lends itself well to machine computation. Example 7 
gives the construction of a Lyapunov function for the canonic 
form of feedback systems. Example 8 is an application of the 
“second method” to an electric circuit consisting of a time-vary- 
ing resistor, inductor, and capacitor. Example 9 gives a rigorous 
justification of the usual linearization procedure. Example 10 is 
a proof of the stability of passive nonlinear networks. Examples 
11 and 13 give conditions, similar to the Routh-Hurwitz inequali- 
ties, for the stability of some nonlinear systems; Theorem 4 is 
probably the best available result along these lines which does not 
depend on the order of the system. 

Estimation of Transient Behavior (Section 8). Since a Lyapunov 
function can be regarded as defining distance from the origin in 
the state space, its derivative V can be used to give a quantitative 
estimate of the speed with which the origin is approached. This 
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leads to an equivalent time constant even for nonlinear systems 
(see Example 14). Similarly, the Lyapunov function can be used 
to estimate the effect of random perturbations and sensitivity 
to parameter variations, as in Example 15. 

Relations With System Optimization (Section 9). If a per- 
formance index is defined as the error criterion integrated with 
respect to time, then the performance index is actually a Lya- 
punov function—provided the error criterion is not identically 
zero along any trajectory of the system. This observation (The- 
orem 5) removes a serious shortcoming of the conventional Wiener 
theory of quadratic optimization since the latter does not auto- 
matically guarantee that the quadratic optimal system is stable. 
Example 16 shows how a conventional quadratic optimization 
problem can be solved by means of Corollary 3.1; Example 17 is 
a rigorous treatment of the same problem using elementary con- 
cepts from game theory. 

Design of a Relay Servo (Section 10). While exact optimization 
is usually difficult or impractical, the ‘second method’’ can sup- 
ply some new ideas for approximate optimization. In Example 
18 we present a very simple design algorithm of this type for a 
stable relay servo (again based on Corollary 3.1) which requires 
only a linear compensating element. The analytical findings 
are supported by an analog-computer study. 


3— Guide to Western Literature 


There are very few expositions of the subject in Western 
languages; most of those available are written from the abstract 
mathematical point of view. The following comments are in- 
tended to facilitate study of the subject from the engineering 
point of view. 

The original work of Lyapunov [1] (in French) is difficult to 
read because of obsolescent mathematical terminology. An 
authoritative survey of mathematical results until 1956 is given 
by Massera [2]. The recent monograph of Hahn [3] (in Ger- 
man) covers existing theoretical results and includes a few ap- 
plications. There is now an English translation [4] of the well- 
known book of Malkin. This book, allegedly addressed to engi- 
neers, is primarily a detailed and rigorous mathematical treat- 
ment of classical problems. There is a section (Chapter VI, par. 3) 
on the basic theorems of Lyapunov in the well-known book of 
Lefschetz [5]; see also his paper [6] on recent generalizations. 
The problem of control-system design has been treated at length 
by Lur’e [7] (German and English translation) and Letov [8]. 

Hahn [3] and the survey article of Antosiewicz [9] give many 
references to Soviet literature. Over the past 10 years, a great 
deal has been published about the ‘‘second method” in the jour- 
nals Avtomatika i Telemekhanika and Prikladnaya Matematika i 
Mekhanika (now both available in English translation). 


4—Description of Continuous-Time Dynamic Systems 


From now on extensive use will be made of vector notation. 
Small boldface Roman or Greek letters will denote vectors. 
Capital boldface Roman or Greek letters will denote matrices. 
The unit matrix is |, the zero vector or matrix 0. The trans- 
pose of a vector or matrix is denoted by the prime; the scalar 
product of x, y is x’y, and the quadratic form associated with a 
square matrix A is x’Ax. The norm is denoted by ||x||; in 
specific calculations this is to be taken as the Euclidean norm 
(x’x)'. The eigenvalues of a matrix F are \,( F). 

In this paper we shall study systems governed by the vector 
differential equation: 

dx/dt = F(x, u(t), t), <éi< + (5) 
This is equivalent to the set of n scalar differential equations 
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dz,/dt = . Ur(t), « 


The vector x is the state of the system (5), its components z, are 
the state variables. The vector u(t) is the control function (or fore- 
ing function or input) of the system (5), its components u,(t) are 
the control variables. The system is specified by the vector-valued 
function f. The integer n is the order of the system. 

If u(t) = 0 for all t, we say that (5) is free (unforced): 


dx/dt = f(x, t) (5-F) 


It is always assumed that the function f in (5) is sufficiently 
smooth so that the equation has a unique solution starting at any 
initial state x» at any time bo. 

To make this idea precise, we first fix u(t) in (5), so that we 
might as well consider (5-F). We assume that there exists a 
unique vector function (t; x0, ts), differentiable in t, such that, for 
any fized xo, to 


(i) xo, = xo 

(ii) dep(t; xo, = H(t; xo, to), t) 
in some interval lt = to| S a(t). The function @ is called a solu- 
tion of the system (5-F) or (5). 

In older texts on differential equations, particularly in the en- 
gineering literature, it is customary to write x(t) for the solution. 
This ‘‘informal’’ usage will be employed here only rarely; reasons 
of clarity demand in most cases (as in Massera [2] and Hahn [3]) 
the use of the explicit notation just introduced. Thus $(t; 
xo, f) means: a solution of (5), with fixed u(t), going through 
state x» at time &, observed at timet. We also call the transition 
function since it shows how x(t) is transformed into x(t). “With 
this interpretation, f in (5) may be called the infinitesimal transition 
function since the differential equation specifies the infinitesimal 
state transition x — x + dx corresponding to the infinitesi- 
mal change in time t > ¢ + dt. 

Now if the differential equation (5) is to represent a physical 
phenomenon, one would always assume also that (t; xo, to) is 
continuous in all of its arguments (of course, continuity in ¢ is 
already demanded by (ii) above!). A sufficient condition for the 
validity of the foregoing assumptions is provided by the following 
classical theorem (Ref. [11], Chap. 1, Theorems 2.3 and 7.1): 

EXISTENCE, UNIQUENESS, AND CONTINUITY THE- 
OREM. Let f(x, t) be continuous in x, t, and satisfy a Lipschitz 
condition in some region about any Xo, to: 


I|x — xl] b(x0) 

i.e., the following condition is satisfied for (x, t). (y, t) in R(xo, te) 
I|fKx, t) — Ky, — yl] 


where k is a positive constant which depends only on b, c. THEN 
1 There exists a unique solution (t; xo, &) of (5-F) starting at 
xo, &, for all |t — to| < a(t), 


Min{c(te), b(x0)/M(xe, te)} 


where M(xo, te) is the marimum asrumed by the continuous function 
||f(x, t)]| in the closed, bounded set R(xo, te). 

2 In some small neighborhood of xo, to the solution is a continu- 
ous function of its arguments. 

Note that the Lipschitz condition implies continuity of f in x, 
but not necessarily in ¢; it is implied by bounded partial deriva- 
tives in x. It should be borne in mind that the local Lipschitz 
condition required by the theorem only implies the desired prop- 
erties of a solution near xo, fe. From it we cannot conclude the 


&) = (b, ¢ > 0) 
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existence of solutions for arbitrarily large values of t as the follow- 
ing example shows: 
Example 3. Finite Escape Time. 


Consider the first-order non- 
linear differential equation 


= 2x? 
The requirements of the existence and uniqueness theorem are 
obviously satisfied 
By separation of variables, the solution is seen to be 


1 
gilt; Za, &) 


—=t— t 
To 
Now if zo > 0, the solution will increase with t; therefore $,(t; 
Tu, te) tends to infinity as ¢ tends to t} = t + 1/zm. As ¢ in- 
creases beyond 4, the original solution which started at to, zo, > 0 
cannot be continued, except possibly by letting the state 2;(t,) 
“jump” from + © to —, which is clearly not the way physical 
systems behave. 
Following Markus [17], one says in this case that the solution 
of the differential equation escapes to infinity in finite time or has 
, finite escape time. (This can be always avoided by replacing ¢ by 
‘¢' = t/(t. — t).) In order that a differential equation represent 
a physical system, this possibility must be ruled out by explicit as- 
sumption to the contrary. 
If the Lipschitz condition holds for f everywhere, there can be no 


finite escape time. To see this (it does not follow directly from 
the theorem!), let 


I|f(x, t) — Hy, < — 
for all x, y, t. 
Integrating both sides of (5-F) and using this, and (8-A) 


: This is easily seen to imply, by the well-known Gronwall-Bellman 
lemma (Reference [49], p. 35), that 


\|(t; xo, to)|] < [exp k(t — x] 


which is less than infinity for any finite (¢ — to). 

However, for some purposes the requirement of a global Lip- 
schitz condition is too restrictive; for instance, in case of servos 
with ramp inputs. 

If the foregoing hypotheses concerning @ are satisfied, the dif- 
ferential equation (5) is said to represent a continuous-time dy- 
namic system. We then call @(t; xo, to) a motion (or trajectory or 
transition function) of the dynamic system. Frequently, it is 
convenient to define a dynamic system axiomatically by means 


(6) 


of its motions satisfying the foregoing hypotheses [10]. These 
axioms may be stated as follows: 
(Ai) (i) x0, tc) = xo forall xe, to (7) 


(ii) xo, to), 4) 
= (ts; Xo, to) for all xo, te, tr (8) 
(existence and uniqueness; see Fig. 3.) 


(As) ts continuous with respect to all arguments. 
(As) © is defined for all x, to, t (no finite escape time). 


A state x, of a free dynamic system (5-F) is an equilibrium state 
if 


f(x,; = 0 forall ¢ 
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or, equivalently, 


o(t; x,,0) =x, forall ¢ 


In other words, a motion passing through an equilibrium state at 
any time is actually at the same state at all times. 
A dynamic system (5) is stationary if 


F(x, u(t), t) = f(x, u(t)) for all x, u(t) 


i.e., if f does not depend explicitly on time (but may depend im- 
plicitly on time through u(t)). Fora free system, this is equivalent 


to saying that every motion is invariant under translation in time, 
i.e., that 


xo, to) = O(t + 7; mo, & + 7) 


for any xo, &, 7. A system which is both free and stationary is 
said to be autonomous. 
A dynamic system (5) is linear if f is a linear function of x and 


u. Assuming for simplicity that 0 is an equilibrium state, i.e., 
that 


(0,0,t) =0 (8A) 


it follows that a linear system (5) can be written in the form 


dx/dt = F(t)x + D(t)u(t) (5-L) 
where 
F(t) = 
D(t) = [df(t)/du,] 
aren X n resp. n X m matrices depending on time. In a sta- 


tionary linear system F and D are constant matrices. 
Fort = to, the general solution of (5-L) can be written as [11] 


P(t; xo, toc) = D(t, + @(t,7) D(r)u(r)dr (9) 


where @(t, to) is called the transition matriz of the system (5-L). 
Its elements may be regarded as the impulse responses under 
appropriate excitations and observations [12, 13]. Thus $,,(t, 
to) is the response of the ith state variable to an excitation de- 
fined by (taking D(t) =1): 


u(t) = — &); ult) = 0 
with x= 0. 


As a consequence of linearity, relations (7) and (8) now imply 
(set u(t) = 0 in (9)!) 


P(t, to) = 1 


when k ¥j 


for all t 


(7-L) 
and 


P(t, to) = D(b, P(t, &) forall ty, hy, te (8-L) 


x 
10) 
Fig. 3 Notation for motions of a dynamic system 
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From this, it follows at once also that 
®D-(t, = P(t, t) for all t, (9) 


Another useful property of the transition matrix, an immediate 
consequence of its definition by (9), is that it satisfies its own 
differential equation: 


AD(t, to)/dt = F(ty@(t, to) for allt, & 


In a stationary linear system @(t, 4) depends only on the dif- 
ference t — t. In fact, it is easy to show [5, 11] that 


(10)s 


P(t, &) = ((t — to) = exp(t—%)F (11) 
=0 


As usually stated, the “‘second method’’ of Lyapunov applies 
only to free dynamic systems, with equilibrium state at 0; 
that is to say, to deviations about some fixed motion. To see this, 
let y(t) be some fixed motion of (5), corresponding to a fixed 
choice of u(t). Let 


x= y(i)+2z 
Then 
dx/dt = dy(t)/dt + dz/dt 
= Ky(t) + 2, u(t), t) 
= y(t), u(t), t) + t) 


Since y(t) is a solution of (5), it follows that 
dz/dt = t), 
9(0,t) = forall? 


so that, for fixed u(t), deviations from the motion y(t) are a free 
dynamic system. 

The description of dynamic systems via the concept of ‘‘state’’ 
(instead of more restricted concepts like impulse response, fre- 
quency response, transfer function, etc., which are invalid for 
nonlinear and sometimes even for nonstationary linear systems) 
is essential for the understanding of the “‘second method.”’ In 
fact, this is necessary even for a precise definition of stability. 
Though not yet in common use, the “‘state’’ method has been 
applied recently to numerous control problems; see references 
{12-16} which also contain much pedagogical material in regard 
to setting up of equations, interpretation of solutions in geometric 
terms, etc. The idea of ‘‘state’’ is well known, of course; it has 
been used in mathematics more or less explicitly at least since 
Poincaré. 


5—Concepts of Stability 


On closer examination, the intuitive idea of stability turns out 
to be very complex. This is particularly true in nonlinear dy- 
namic systems. As a result, a large variety of precise definitions 
of stability have been proposed, often differing in subtle ways 
which are confusing to the uninitiated. Here we shall present the 
principal definitions used at present and discuss their relationships 
to one another. 

The question of stability concerns deviation about some fized 
motion. By the remarks at the end of the previous section, it 
follows that it suffices to consider deviations from an equilibrium 
state x, of a free dynamic system. 

Perhaps the oldest concept of stability is this: If the system 


5 In the literature [5, 11], any nonsingular matrix function of time 
satisfying (10) is called a fundamental matriz of (5-L). In our case, 
the transition matrix is a fundamental matrix made unique through 
the requirement (7-L). 
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(5-F) is perturbed slightly from its equilibrium state at the origin, 
all subsequent motions remain in a correspondingly small neigh- 
borhood of the origin. For instance, harmonic oscillators (Ex- 
ample 1) possess this type of stability. The precise definition goes 
back to Lyapunov: 


(S:) An equilibrium state x, of a free dynamic system is stable 
if for every real number € > 0 there exists a real number 4(€, to) > 0 
such that ||xo — x,|| < 4 implies 


xo, &) — x,|] € for allt = t& 


Fig. 4 illustrates the definition. Note that stability in the 
sense of Lyapunov (definition (S,)) is a local concept; it refers to 
behavior near the equilibrium state since one does not know a 
priori how small 6 in the definition may have to be chosen. 

Observe further that if there is stability for some initial time ty 
there is stability for any other initial time t,, provided all motions 
are continuous in the initial state (Axiom (A¢)). 

In fact, if t; < t define (possible, since @ is continuous in t/) 


Q(x, 4, = Max x, t:) — x,!| (12) 


This function is clearly 0 when x, = x,; elementary but some- 
what tedious arguments show that, due to (A:), Q is con- 
tinuous in x;. Hence there is some v(d(€, t), 1) > 0 such that 
— x,!| v implies |(t; x, %&) — x,|| < the interval 
4 StS t&. By stability at &, it follows that ||@(t; =, 4) — 
x,!| < eforallt = We can take d(€,t) = v. Ift S th, merely 
let Q = (to; x, 

In most engineering applications, one is interested in a stronger 
kind of stability; namely, the motion should return to equi- 
librium after any small perturbation. The classical definition of 
Lyapunov is the following: 


(S:) An equilibrium state x, of a free dynamic system is 
asymptotically stable if 

(i) it is stable and 

(ii) every motion starting sufficiently near x, converges to x, 
ast-» . In other words, there is some real constant r(4) > 0 
and to every real number yu > 0 there corresponds a real number 
xo, to) such that — < r(é) implies 


xo, — xJ! wforallt > 


See Fig. 5. Asymptotic stability is also a local concept; one 
does not know a priori how small r(t) may have to be. If a free 
linear system is asymptotically stable, then ||x, — x,!! < || xo - 
implies that =, to) < xo, to) for any and &, since 
the initial state is factored out in @ (see (9)). In other words, 
among motions starting at the same distance from x, none will 
remain at a distance larger than u from x, at arbitrarily large val- 
ues of time. This is expressed by the following definition due to 
Massera [18]: 


(S,;) An equilibrium state x, of a free dynamic system is 
equiasymptotically stable if 

(i) it is stable and 

(ii) every motion starting sufficiently near x, converges to x, as 
t— © uniformly in x. In other words, 7 in definition (S,-ii) 
is of the form Tu, r(to), to). 


A somewhat puzzling aesthetic deficiency of definitions (8; — 
8.) is that one has to assume erplicitly that (S,) holds. The fol- 
lowing example shows that (S.-ii) does not imply (S,) or (8,-ii) 
when n > 1. 

Example. Consider the second-order system in polar co-ordi- 
nates x = (r,9),0 r< ~,0 < 2n: 


= [9(0, t)/9(8, 
6=0 
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Fig. 4 Definition of stability 


where 


sin? 0 1 


t) = sin‘ + (1 — ¢sin* 6)? 7% +? 


Then 


r(t; To t) 19( 9, t)/g( to)] ro 
A(t; ro, 9, = 


The motion is continuous with respect to ro, %. (Ss-ii) is 
satisfied. However, (S;), (S;-ii) do not hold because at ¢; = 
sin 89, t:) > sin~? 0, which can be made arbitrarily large as 
6.— +7. Here the function T(p, xo, 4) is not continuous in x» 
along the ray 0 = +7. 

On the other hand, (S,-ii) implies (S,) if all motions are con- 
tinuous tn xo. 

In fact, let : = t + T(€, r(to), &) and define 


A(x, fo, T) = Max xo, &) — x,| 
&stsh 
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Fig.6 Interrelations between stability concepts 


Fig. 5 Definition of asymptotic stability 


Since T does not depend on xo by (S;-ii), it follows by (Az), as in 
(12), that A is continuous in xo. Hence there is some 8(€, t&) such 
that ||x0 — x,|| < 4 implies 


l(t; xo, &) — S forall 


By (S;-ii), the inequality then holds for allt = t. 

(S:) obviously implies (S:). (S:) is equivalent to (Ss) if the dy- 
namic system is of the first order [18]. In fact, ifz,<y<2z 
(scalars), then for all t, to O(t; y, to) < @ (t; 2, to), for otherwise 
by continuity in ¢ one would have z = $(4; 2, t) = O(h; y, tbr) 
which says that the motion starting at z, 4; may be either at z or y 
at &, contradicting axiom (A;). Therefore, if 


|x| < r(to), T(u, 20, to) S Max T(u, + r(to), to) 
Another important property of linear systems is that stability 


is independent of the distance of the initial state from x,. This 
is expressed by 
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(S,) an equilibrium state x, of a free dynamic system is 
asymptotically [(S,): equiasymptotically) stable in the large if 

(i) it is stable and 

(ii) every motion converges to x, [converges to x, uniformly in 
xo for ||xo|| < r where r is fixed but arbitrarily large] ast —> ©. 


If a system is linear (S,-S;) are equivalent. 

A motion is said to be bounded for every xo, to if there is some 
constant such that ||(t; xo, &)/| B for allt = &. 
The motion is eguibounded if B(xo, 6) S B(r, te) for all || nol] sr. 
These properties are implied by asymptotic resp. equiasymptotic 
stability in the large as is seen from the proof that (S,-ii) implies 
(S,). 

If a system is autonomous (= free + stationary), then 6, r, T 
in the preceding definitions do not depend on &. It is not hard to 
show [2] that the same holds also if the function f(x, ¢) is periodic 
in t. There are systems which have this property without being 
autonomous or periodic. This motivates the various definitions 
of uniform (in time) stability. 

(Se) Uniform stability is stability such that 6 does not depend 
on by. 

It turns out that asymplotic and equiasymptotic stability are 
equivalent if they are uniform in to. 

In fact, take any u > 0. Since the foregoing hypotheses imply 
(Se), there is some corresponding 6(u) > 0. Consider any x in 
the region |/x — x,|| < r. By uniform (in &) convergence, find a 
7(6/2, x) such that 


lp(t; x, &) — 6/2 


By continuity with respect to the initial state, there is some 
neighborhood U(x) of x such that if y is in U(x) then 


b(t + 7; — <3 


so that, because of (S.), the motion starting at y remains within 
distance of at most yp of x, for allt = t& + T. 

Since ||x — x,|| < r is compact, it is covered by definition by 
a finite number of U(x), say, U(x), ..., U(xy). Hence define 


r) = Max {7(6/2, x,)} < 


which completes the proof. 

The foregoing argument leads to a finite value of T(y, r, &) if r 
and 7’ are dependent on &. Then we can conclude: (S,-ii) to- 
gether with implies (Ss). 

These observations motivate the following definition, which is 
standard: 

(S:) Uniform asymptotic stability is equiasymptotic stability 
such that 6, r, 7 do not depend on &. 

It is clear that uniform asymptotic stability involves the ex- 
plicit assumption of uniform stability. Unlike in the case of non- 
uniform stability, where (S,-ii) implies (S,), this assumption can- 
not be dispensed with as the following example due to Massera 
shows: 


Example 4. Impulse Response With Growing Peaks. Consider 
the first-order linear system 


(4t sin t — 


By separation of variables and integrating, the impulse response 
of this system (=1 X 1 transition matrix) is 


= exp (4 sin t — 4t cost — 
— 4sin + cos + t*) 


which is less than 


exp [4(2 + + — 
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which shows that |||) — 0 with t+ © uniformly in & (and of 
course uniformly in | xo!! < r, by linearity.) However, 


du((2n + = exp — + 1)] 


The “peaks” of the impulse response increase indefinitely as 
There isnouniform stability; in fact, the motions are not 
even uniformly bounded. 

This example motivates also the following definition: 


(Ss) An equilibrium state x, of a free dynamic system is 
uniformly asymptotically stable in the large if 

(i) it is uniformly stable, 

(ii) it is uniformly bounded, i.e., given any r > 0 there is some 
B(r) such that — r implies ||@(t; xo, &) — < B 
for allt = b, 

(iii) every motion converges to x, as -—> © uniformly in & and 
| xo!! Sr, when r is fixed but arbitrarily large; i.e., given any r > 0 
and ys > 0 there is some 7(y, r) such that |/xo — x;|| < r implies 
l(t; xo, to.) — x,|] S wfor allt 20% + 7. 


One can define a type of boundedness by taking B(x); how- 
ever, this condition, with (S,-iii), would imply (S,-ii), by argu- 
ments similar to those concerning uniform asymptotic stability. 

By a slight extension of Example 4, it follows that (S--iii) does 
not imply (S,) in general. A case where this is true occurs when 
f(x, ¢) satisfies a uniform Lipschitz condition, i.e., 


|x, t) — Ky, Ol] < — 


where k is a constant independent of x, y, or t. To prove this 
assertion, one merely makes use of the relation (6). 

The different types of stability, summarizing the discussion of 
this section, are displayed in Fig. 6. 

It is apparent that concepts of stability are closely related to 
concepts of convergence. When there are many of the latter, 
there are correspondingly many types of stability. For instance, 
in the study of stochastic systems, one can talk about stability in 
probability, stability in mean, almost sure stability, etc. It is 
not hard in principle to apply the “second method” to such 
problems as has been done by Bertram and Sarachik [19]. 


6—The Main Theorems 


The application of the “second method” to stability problems 
consists of defining a Lyapunov function with appropriate prop- 
erties whose existence implies the desired type of stability. We 
shall consider uniform asymptotic stability in the large and de- 
fine the class of Lyapunov functions for this case. By weakening 
the various requirements on Lyapunov functions, we obtain other 
stability theorems as a by-product. See also Massera [2], Hahn 
([3], and Malkin [4]. 

Theorem 1. (Lyapunov) Consider the continuous-time, free dy- 
namic system 


dx/dt = (x, t) (5-F) 
where (0, t) = O for all t. 

SUPPOSE there exists a scalar function V(x, t) with continuous 
Jirst partial derivatives with respect to x and t such that V(0, t) = 0 
and 

(i) V(x, t) is positive definite; i.e., there exists a continuous, non- 


decreasing scalar function a such that a(0) = 0 and, for all t and 
allx #0 


0 < < V(x, 0) 
(see Fig. 7); 
(ii) There exists a continuous scalar function yy such that y(0) = 0 


and the derivative V of V along the motion starting at t, x satisfies, for 
all t and all x # 0, 
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‘ ae 


dV(x, 


along motion 
dt att,x 


_ dV(G(7; xo, t), 
dr 


Vix, j= 


r=t 


[V(x + hf(x, t),¢ +h) — V(x, t)]/h 


= OV /dt + (grad V)’F(x, 


— < 08; 


/ 


(iti) There exists a continuous, nondecreasing scalar function such 

that B(0) = 0 and, for all t, 
V(x, t) Bi 

(iv) a(||x\|) © with ||x|]| 

THEN the equilibrium state x, = 0 is uniformly asymptotically 
stable in the large; V(x, t) is called a Lyapunov function of the 
system (5-F). 

Corollary 1.1. The following conditions are sufficient for the 
various weaker types of stability: 

(a) Uniform asymptotic stability: (i-iii). 

(b) Equiasymptotic stability in the large: (i-ii), (iv). 

(c) Equiasymptotic stability: (i-ii). . 

(d) Uniform stability: (i), (iii), and (ii): V(x, t) S 0 for all x, t. 

(e) Stability: (i-ii). 

(f) No finite escape time: (i), (iv), and (iig): V(x, t) S cV(x, t) 
for all x, t; c being a positive constant. 


Corollary 1.2. For a continuous-time autonomous dynamic sys- 
tem 


dx/dt = (x) (5-A) 


(0) = O, equiasymptotic stability in the large is assured by the 
existence of scalar function V(x) with continuous first partial deriva- 
tives with respect to x, such that V(O0) = O and 

(i-A) V(x) > 0 for allx ~ 0 

(ii-A) V(x) < 0 for allx #0 

(iv-A) V(x) © with ||x|| > © 


Corollary 1.3. Jn Corollary 1.2, Condition (ii-A) may be replaced 
by: 

(iir-A) V(x) <0 for all x 

(ii-A) V((t; xo, to)) does not vanish identically in t = to for 
any to and any x = 0. 


Proof of Theorem 1. Using (ii) we see that V is decreasing along 
any motion: 


V(t; Xo, to), t) = V(x0, to) 
= 6,4), <0, t> (14) 


(a) To prove uniform stability, consider any «€ > 0. Take 
6(€) > O such that 6(6) < a(€)—this is possible because 8 is con- 
tinuous and 0 at 0; see Fig. 7. Then if |x! < 6, t& being arbi- 
trary, we have using (ii) and (14) (see Fig. 8), for allt = & 


a(e) > B(6) => to) 
V((t; xo, to), t) 
a(||(t; xo, 


® This is slightly weaker than the usual requirement that V be 
negative definite, since we do not need the hypothesis that y is non- 
decreasing. 

7 In the literature, particularly the Russian, this condition is some- 
times called: ‘‘V has an infinitely small upper bound.” 
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But since @ is nondecreasing and positive, this implies 


for arbitrary &, which is what was to be proved. Note that (iv) 
is not needed and that it is sufficient to have V < 0. 

(b) Now we prove that ||¢(t; xo, to)|| > 0 with t—> © uniformly 
in t and ||xol| 

Take any positive constant c, and find an r > 0 satisfying B(r) 
<a(c,). Take any initial state |! xo! sr. By part (a) of the proof, 
then |/q(t; xo, to)|| < for allt = t, to being arbitrary. 

Now take any 0 < |} Find a > such that 
< a(yu). Denote by c(u, r) > 0 the minimum of the continuous 
function ||) on the compact set »(u) < < a(r). Define 
T(u, = B(r)/cxu, r) > 0. 

Suppose now that l(t; Xo, to)|| > vover the intervalh St st 
= t+ 7. Again by (ii) and (14), we have 


0 < a(v) s Xo, lo), 
S V(xo, to) — (4 — 
— Tc, = 0 


which is a contradiction. Hence for some / in the interval &, t, 
say we have ||x2|| = Xo, to)! =v, Therefore 


S V(xz, te) 


B(v) < 


B (il xl) 


a 


| 
1 


8 (e) 
Fig.7 Definition of V, a, 8 


Vix) = B(8) 


~V(x) = a(e) 


a(il x(t)il) < Vix(t)) ¢ Vix s B(8)<ale) 


Fig. 8 Proof of asymptotic stability 


Transactions of the ASME 


; (13) : 
= 
| vi x,t) 
4, | 
~ 
Af 
\\ 
| 
/ 
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for allt 2 & Hence 
x0, to)|| forallt > + 7T(u,r) 


which proves uniform asymptotic stability. 

(c) To prove uniform asymptotic stability in the large, i.e., 
that the constant r can be chosen to be arbitrarily large, observe 
that by (iv) there exists for any r a constant ¢(r) such that B(r) 
< alec; ). 

Moreover, uniform boundedness is automatic, on setting B(r) 
= o(r). Q.E.D. 

Proof of Corollary 1.1. (a) This is contained in the proof of 
Theorem 1. 

(b) Let 8*(||x!|, ¢) be the maximum of V(y, ¢) for || y|! < ||x!]. 
Replacing by 2), the proof of Theorem 1 proceeds 
as before, but now 6 and T are also functions of t. 


(c) Use 8* instead of 8 and omit Part (c) of the proof of 
Theorem 1. 

(d) Part (a) of the proof of Theorem 1. 

(e) Use 8* defined previously and proceed as in Part (a) of the 
proof of Theorem 1. Then 4 is also a function of t. 

(f) An easy calculation shows that (ii,) implies 


ViGlt; mo, to)) V(mo, to) 
and by (i) 
Because of (iv), ||x|| = © implies a(/|x|!) = © which is impossible 
for any finite t. Q.E.D. 
Proof of Corollary 1.2. By (iv), given any a > O there is a 


b(a) > O such that V(x) > a when |'x!| > 6. (If (iv-A) is missing 
and only local stability is of interest, pick b > 0 arbitrarily.) Let 


e(||x!!) = {Min V(y); ly || = 


and then define 


a(||x|]) = Min {V(y); {lxl] < lly! < 
y 

y 

= Min {—V(y); |lyl] = 
y 


The functions a, 8, y obviously satisfy the hypotheses of 
Theorem 1. Q.E.D. 

Proof of Corollary 1.3. It suffices to show that (ii-A), (ii,-A), to- 
gether with (i-A), (iv-A) imply asymptotic stability in the large 
for stationary systems. By (iv-A) and (ii,-A), for every x» there is 
a constant such that xo, fo)|| S c for allt = t, to 
being arbitrary since the system is stationary. 

Suppose ||(t; xo, 2» > 0. By (i-A) and (ii-A) the in- 
tegral 


—V(m) < x <0 


is nonincreasing and bounded as t — ©, which implies that V is 
identically zero on the positive limit set [11] of x, which is con- 
tained in the region v S ||x|! < «. Since the limit set is itself a 
motion, this contradicts (ii-A). Hence every motion will 
eventually enter the set |!x|| < v. But if v is chosen so that B(v) < 
a(u), then asymptotic stability in the large follows, as in the 
proof of Theorem 1. Q.E.D. 

As already pointed out, there is no explicit prescription in gen- 
eral for finding Lyapunov functions satisfying the requirements of 
Theorem 1 or of its corollaries. For this reason, it is very im- 
portant to know when a Lyapunov function with certain proper- 
ties exists, so that the search for one should not be in vain! It 
turns out that this question is closely related to the concept of the 
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uniform asymptotic stability. We now quote what is at present 
the best available result along these lines [2]: 

Theorem 2. (Massera) Let the function t defining the differential 
equation (5-F) be Lipschitzian. Assume also that (0, t) = 0 
and that the equilibrium state xe = 0 is uniformly asymptotically 
stable in the large. 

Then there exists a Lyapunov function V(x, t) which is infinitely 
differentiable with respect to x, t, and satisfies all of the hypotheses of 
Theorem 1. 

Thus the existence of a Lyapunov function as required by 
Theorem 1 is necessary and sufficient for uniform asymptotic 
stability in the large. 

PROOF: See Reference [2], Theorem 23*. 

The stability problem in nonstationary linear systems is one of 
considerable subtlety. Roughly speaking, the requirement of 
“uniformity” must be present in the formal definitions of stabil- 
ity, in order that it should coincide with the intuitive concept of 
stability. The troubles are illustrated by the following example: 

Example 5. Stable System With Unbounded Step Response. 
Consider the nonstationary linear system 


+ = w(t) 


for values of 0 < & < t. (For values of & below 0, the equation 
has finite escape time (see Example 3), and therefore does not 
define a dynamic system.) The impulse response (which in this 
case is also the 1 < ! transition matrix) of this system is 


gull, bb) = b/t; 


The system is evidently asymptotically stable, but not uni- 
formly asymptotically stable: If t >Oand T = (t — t) is a fixed 
constant then 


Oult, to.) = b/(T + th) > */2 


for & sufficiently large, no matter how large T is. 

Now suppose that the initial state at some & > 0 is zero. Let 
the input y:(/) be a unit step function occurring at ¢ = & Then 
the unit step response is given by 


dult, r)dr 


= (t — &?/t)/2 


which tends to infinity with ¢/ 

The difficulty here is due to the fact that the impulse response 
does not tend to 0 exponentially as in an asymptotically stable 
linear stationary system. It so happens that, in the linear case, 
exponential convergence to equilibrium is equivalent to uniform 
asymptotic stability. The latter, in turn, is equivalent to the 
existence of a Lyapunov function or the fact that every bounded 
excitation produces a bounded response. These facts are sum- 
marized by: 

Theorem 3. (Lyapunov-Perron-Malkin) Consider a continuous- 
time, linear dynamic system 


dx/dt = F(t)x + D(t)u(t) (5-L) 
subject to the restrictions 
(i) || S at < forallt 
(ii) O < < < for all ||x|| = 1, allt 


THEN the following propositions concerning this system are 
equivalent: 


(A) Any uniformly bounded excitation 
llu(t)|] (t 2b) 


8 In this theorem and its proof, c:, c2, . . . are used to denote various 
fixed positive constants. We continue to use this convention in the 
sequel. 
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gives rise to a uniformly bounded response for allt = to 


S < ©; (14-A) 


(B) For allt = to, r)\|dr S < 


(C) The equilibrium state xe = 0 of the free system is uniformly 
asymptotically stable; 
(D) There are positive constants cz, cs such that, whenever t 2 ty, 


l@(t, < (15) 


(E) Given any positive-definite matrix Q(t) continuous in t and 
satisfying for allt = t 


0< col s A(t) S cyl < (16) 


the scalar function defined by 


= (18) 


exists"! and is a Lyapunov function for the free system satisfying the 
requirements of Theorem 1, with its derivative along the free motion 
starting at x, t being 


V(x, t) = 


Propositions (A )-(D) relate to different abstract notions of 
stability. The equivalence of (A) and (B) was first proved by 
Perron [20]. Zadeh used (A) as his basic definition of stability 
{21}. By proposition (£), the preceding notions of stability are 
equivalent to the existence of a Lyapunov function. Malkin [22] 
was the first to prove that (C) is equivalent to (EZ); his method of 
proof implied condition (B), hence, by Perron’s result, the proof 
of the entire theorem. See also Antosiewiez and Davis [23]. 
The proof given below, based on unpublished work of Kalman. is 
much simpler than the original proof of Malkin and Perron, due 
to the systematic use of the transition matrix. 

None of the propositions in Theorem 3 offers a method of de- 
termining whether a given system is uniformly asymptotically 
stable or not without computing its transition matrix for all t, t. 
Such a method is known so far only in the case of linear stationary 
systems and is already contained in Lyapunov’s original memoire: 

Corollary 3.1. (Lyapunov) The equilibrium state x, = 0 of a 
continuous-time, free, linear, stationary dynamic system 


dx/dt = Fx (5-FLS) 
is asymptotically stable (a) if and (b) only if given any symmetric, 
positive-definite matrix Q there exists a symmetric, positive-definite 
matrix P which is the unique solution of the set of n(n + 1)/2'* 
linear equations 
F'P+PF = —-Q (19) 

and ||x||2p is a Lyapunov function for (5-FLS). 

A slight extension of this result is useful for some purposes: 

Corollary 3.2. (Kalman) The real parts of the eigenvalues of a 
(constant) matrix F are < o if and only if given any symmetric, 
positive-de finite matrix Q there exists a symmetric, positive-definite 


*If A, B are two matrices, we write A < B[A 3S B] to express the 
fact that B — A is positive-definite [semidefinite]. 

1” We write (see Appendix) x’Ax =||x|/?4 where A is a symmetric, 
positive-definite matrix. 

11 T.e., the integral defined by (17) is finite for all finite values of x 
andt. 


12 Since P is symmetric, it has precisely this many unknown ele- 
ments in (19). 
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matrix P which is the unique solution of the set of n(n + 1)/2 linear 
equations. 


—2oP + FP+PF=-Q 


Proof of Theorem 3. (Kalman) The proof consists mainly of 
using the multiplicative property (8-L) of transition matrices and 
various technicalities involving norms and _ positive-definite 
matrices. 

To show that (A) implies (B). If (B) did not hold, for at least 
one pair (7*, j*) of subscripts the integral 


must tend to @ as (t — By (ii), 1/e, < < 1/e2 
forallr. Therefore define, forall <7 st 


(20) 


where 

if z=0 
-1 if r<0 


sgnz = 0 (21) 


Then u(r) is bounded; 
(75), with xo = 0, 


we can take cq = 1/ce:. Using this and 


> ele, 0) 2 x(2)}| 


n 


n t 
| te | 


= r)\dr 
to 


By assumption, the last integral tends to infinity with (¢ — &), 
which is a contradiction. 
To show that (B) implies (C). (This is the only nontrivial part 
of the proof.) First we show that (B) implies 
|@(¢, &)|| < ou < 


By (i) and (B) we can write 


(t 2 b) (22) 


fe 


Since the transition matrix satisfies its own differential equation 
(10), we obtain, after using repeatedly (8-L) 


t) = 1) F(r) = r) F(r) 


o> ccs 2 If. 


Therefore 


= 


which implies (22). 
Using (B), (22), and (8-L), 
= w(t 6), 
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Letting 


r) = = 


it follows that 
to)xo|! 


when ||xo|| < r and¢ 2 T + &, so that we have uniform asymp- 
totic stability. 

To show that (C) implies (D). Since uniform asymptotic sta- 
bility implies uniform stability (see definitions (8;), (Ss)), the 
transition matrix is uniformly bounded. In fact, take any € > 0, 
0 < ||xol| < d(€). By linearity: 


= 
By uniform asymptotic stability, let 7 = 7(2~', 1) so that 


independently of & By induction and (77) 


+ kT, + kT, + (k — 1)T)I|... 


+ T, 
< 


Hence if cs = (log 2)/T > 0. then 
To show that (D) implies (A). By (15), (ii), and (78) 


| t 1 


t 


so that (14-A) holds. 

By Theorem 1, (£) implies (C). 

To show that (D) implies (Z). This is essentially Theorem 2; 
we give an independent proof. Proceeding as above and using 
(16) 

By (15) 
= Billxll) < (23) 
Further, using (i), (16), and (10) 


The derivative V of the Lyapunov function V along the free 
motion starting at t, x is obtained by direct computation 


V(x, t) = lim [Vid(t +h; x,t), t+ h) 
h—~oh 


V(x, t)} 


t+h 
= lim — f | (24) 
t 


when x #0, 


This completes the proof of Theorem 3. 
Proof of Corollary 3.1. (a) Let P > O satisfy (19). Define 


V(x) = = x’Px>0O whenx «0 
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as a tentative Lyapunov function for system (5-FLS). Then, by 
(13) 


Vix) = (grad V)/Fx = 2(Px)’Fx = F’P + PF)x (25) 
and by (19) 
V(x) = —x’Qx = < 0 when x ~ 0 


so that V satisfies the hypotheses of Corollary 1.2; therefore (5- 
FLS) is asymptotically stable. 

(b) Suppose (5-FLS) is asymptotically stable. Since it is 
stationary, it is then also uniformly asymptotically stable. There- 
fore, by Theorem 3 (C) and (£), the matrix P(t) defined by (18) 
exists. Assuming Q(t) = Q = const it follows then by the 
stationarity of (5-F LS) that P(t) = P = const. Also, the deriva- 
tive of the Lyapunov function V(x) defined by (18) is —x’Qx, 
which is equal to (25), so that P satisfies (19). 

To show that P defined by (18) is the unique solution of (19) 
let P be any solution of (19). Then, by the definition (11) of the 
transition matrix in the stationary case 


P= ff (exp F’r)Q(exp Fr)dr 


(exp F’r)( + Fr)dr 


Since F commutes with exp Fr for all r 


P = [(exp F’r)P(exp Fr) jdr 


ie 
= —(exp F’r)P (exp Fr) = p 


the last equality following from asymptotic stability. Q.E.D. 
Incidentally, Bellman [24] proves that (19) has a unique (but 
not necessarily positive definite) solution for any Q whenever 
ACF) + AX F) 0 for all i, j. 
Proof of Corollary 3.2. The assumptions on F require that 


F)<o 


From the characteristic equation of a matrix it follows that this is 
equivalent to: 


F — al) <0 


Replacing F in (19) by F — ol yields (20). (Notice that this 
argument is equivalent to the “‘shift’’ theorem in the theory of the 
Laplace transform.) Q.E.D. 


7—Applications to Stability Theory 


7.1 Routh-Hurwitz Conditions for Linear Systems. 
Analogously to the various methods associated with the names of 
Routh and Hurwitz, Corollary 3.1 provides a purely algebraic 
procedure [25] for testing whether a given free, linear, stationary 
system is asymptotically stable: 


(a) Take an arbitrary symmetric, positive-definite matrix Q 
(say, the unit matrix); 

(b) Solve (19) for the unknown components of the symmetric 
matrix P. This is a set of n(n + 1)/2 linear equations in the un- 
known elements pu, . . ., Pin} P22, +) Dani Pan OfP. (The 
requirement that this set of equations have a nonzero determinant 
is implied by the Routh-Hurwitz inequalities. ) 

(c) Check to see if P is positive-definite. It suffices to check 
if all leading principal minors" of P are positive. This supplies 
the Routh-Hurwitz inequalities. 


4 T.e., the determinants of the following submatrices of P: (1) P 
itself, (2) P with the nth row and column deleted, . . ., (nm) the 1-1 
matrix consisting of pi. 
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This procedure can be used to derive the Routh-Hurwitz in- 
equalities as in Example 6. However, this is mainly of theoretical 
interest since the derivations are very complicated when the 
matrix F contains more than about 4 or 5 nonzero coefficients. 

The principal significance of this procedure is that it provides 
a convenient and efficient algorithm to determine stability by 
machine computation. This is far from trivial in the case of 
large-scale systems since the Routh-Hurwitz conditions in their 
usual form apply only to a single, nth-order equation (which has 
in general n different coefficients). Before the Routh-Hurwitz 
conditions can be applied to a general linear system of n first-order 
equations (which in general has n?* different coefficients), it is 
necessary to compute the characteristic equation of the matrix F; 
this is far from a trivial job when the number of nonzero co- 
efficients in F is large. These difficulties are avoided when (19) is 
used. Another efficient computation procedure for determining 
stability of general linear stationary systems, due to Schwarz 
{26}, reduces F to a canonic form by means of elementary matrix 
transformations. This is also related to the ‘second method” as 
Example 7 shows. 

The concept of a Lyapunov function as in Corollary 3.1 provides 
a convenient link between the Routh-Hurwitz conditions and 
other aspects of control-system theory, such as the integral of the 
quadratic error. This connection, which has come to surface 
only recently [27], answers an oft-heard but superficial dictum in 
elementary courses or textbooks, that “the Routh-Hurwitz 
criteria do not provide a quantitative measure of stability.” 
Much further research is needed to completely elucidate these 
connections. 

Example 6. Routh-Hurwitz Conditions for General Second-Order 
Case. Let Q = I and write out explicitly (19) in the case when 
F is a 2 X 2 matrix: 


fre Piz Piz P22 a fn 0-1 


This system of equations is equivalent to 


fie fu tfe fa = 0 


which has a unique solution if and only if the determinant of the 
matrix F on the left is nonzero: 


det = 4(fun + fo fufee — fifa) 
n>(tr F)(det F) 0 (26) 


Assuming (26), after tedious elementary computations we find 


-1 F + far? + —(fifee + faufu) ] 
— (firfee + fufu) 


2(tr F)(det F) det F + fu? Si? 


(27) 


This matrix is positive definite if and only if 


(fi + fee)? + (fi2 — fu)? 
2(tr F)%(det F) 


det P = 


>0 (28) 


_ _det F + fu* + 
Qtr FXdet F) (29) 


But (28) implies that 


det F = Sufee >0 (30) 
and this fact, with (29), implies that 


tr F = fu + fea <0 (31) 


and (30-31) together imply (26). 
To put these results in a better-known form, let 


F = | (32) 


Then (30) and (31) require that a; > 0, a2 > 0 which are the well- 
known Routh-Hurwitz inequalities for a constant-coefficient 
linear differential equation of second order. 

Example 7. Canonic Form for Linear Stationary Systems. Con- 
sider the system shown in Fig. 9. The infinitesimal state-transi- 
tion matrix F of this system is: 


0 Seer 0 0 


In a little-known paper, Schwarz [26] has proved the following: 

If a real constant matrix F is similar to a matrix of form (33) 
(ie., F = TCT) then the number of eigenvalues \,( F) of F which 
have negative real parts is equal to the number of positive terms in the 
sequence 


provided none of the a; is zero. 
Under mild restrictions Schwarz [26] also gives an algorithm 


for computing C( F) by means of elementary transformations. 
When all the a; are positive, it is easy to show that the system 


dx/dt = Cx (34) 


is asymptotically stable. (The matrix (32) is a special case of 
this!) In fact, let P be the matrix whose elements on the main 
diagonal are given 
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Fig.9 System corresponding to matrix (33) 
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Pig = (¢ = 1,... 


while all other elements are zero. If all the a; are positive, then P 
is clearly positive definite. Let V(x) = ||x!'%p. This function 
clearly satisfies hypotheses (i-A) and (iv-A) of Corollary 1.2. 
Further, a simple computation shows: 


Vix) = x'(grad V) = 2x’C’Px = —2a,7,2 < 0 


which satisfies hypothesis (ii,-A) of Corollary 1.3. To verify hy- 
pothesis (ii,-A), it is necessary to show that Vid(t; xo, &) is not 
identically zero in ¢ for any initial state x» # 0. Now if $,%t; 
Xo, 4) is to be identically zero in ¢t, then by inspection of Fig. 9 it 
follows that d@,(t; xo, &)/dt must be identically zero in ¢. This 
in turn requires that @,-:(t; xo, &) be identically zero in t. Pro- 
ceeding in this fashion, we see that every component of $(t; 
xo, &) must vanish identically in ¢, i.e., xo is an equilibrium state of 
(34). Because 


n = odd | 


n= even 


— 


+ 


det C -{ 


the system (34) has but one equilibrium state which is x, = 0. 
Thus V does not vanish identically along any motion starting at 
x #0. 

7.2 Stability of Linear Nonstationary Systems. No procedure simi- 
lar to Corollary 3.1 is available so far which would permit deter- 
mination of the stability of general linear nonstationary systems 
in an algebraic way. There is little hope that this state of affairs 
will change soon. It is therefore necessary to use ingenuity in 
finding explicit Lyapunov functions. Without discussing this 
matter in detail (see References [28], [29]) let us outline briefly an 
obvious procedure. 

Take a matrix P(t) which is a differentiable function of ¢ subject 
to 


P(t) Sel < @ 


for allt. Then the function 


Vix, t) = 


obviously satisfies hypotheses (i), (ili), (iv) of Theorem 1 (in 
fact, a(||x\|) = = x! If, in addition, 


(35) 


— Q(t) = F’(d)P(t) + P(e) F(t) + P(t) S <0 


then Vix, t) = —||x|aw 


satisfies hypothesis (ii) of Theorem 1, and V defined by (35) is a 
Lyapunov function. 

The problem, of course, is to choose P(t) in such a way that 
Q(t) is positive definite for the largest possible region of parame- 
ters F(t). At first, it is best to fix Q and use Corollary 3.1 to get 
an initial guess for P. 

Example 8. Time-Varying Network. Consider the time-varying 
network shown in Fig. 10. The differential equations governing 
the dynamics are: 


Rit)i, 
2 R(t) 
Lithi, a=Cit)e, 
Lit) Lit) 
“Oy ari, 


Fig. 10 Time-varying network 
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(37) 


a = 
where z; = q = charge on capacitor and z, = W = flux in induc- 


tor. Let 
R(t) + 2L(t)/RWA)C(t) 1 
| 1 
and it follows immediately from (36) that 
2C-t) 0 
| 0 Pw) 
Assuming 


0<ae 
SRI) Su < 


the system (37) is asymptotically stable if for allt 2 & 
<1 + R(L/R* — C/2) + CL/RC — L/R 
0<€ < 1+ RL/R? 


Even in this simple case, it is difficult to get sharp (i.e., 
almost necessary ) sufficient conditions without considering in de- 
tail the functions L(t), C(t), and R(t). 

7.3 Stability of Nonlinear Systems. In this case, even more than 
before, intuition must be used to obtain suitable Lyapunov func- 
tions. There are a number of well-known examples of how to do 
this; knowledge of the examples helps to obtain new Lyapunov 
functions. 

Example 9. Linearization About Equilibrium. It is common usage 
in engineering to consider only small deviations from the operat- 
ing point (i.e., equilibrium state). This is done by expanding f in 
(5-F) in a Taylor series about the operating point and neglecting 
all higher-order terms. 

It is easy to give a rigorous proof of the legitimacy of this pro- 
cedure by means of the second method. Let y = x — x, and 
assume that f in (5-F) is analytic in a neighborhood of x,. In 
that neighborhood f has a Taylor series which can be written as: 


dy/dt = F(x, t) - F(x,, t) + [ F(t) + Gly, t) (x + x,) 
= [F(t) + Gly, Oly 


where F(t) is the Jacobian matrix of f evaluated at x,, ¢; and 
Gy, ¢) is a matrix such that |/G(y, éy!!/!y|! tends to 0 with 
lly! — 0. 

Now suppose that 

(i) |] < for all t 

(ii) The linearized system 


(38) 


dy/dt = F(t)y (39) 


is uniformly asymptotically stable. 

Then by part (£) of Theorem 3 there is a Lyapunov function 
V(y, ¢) for (39). The derivative of this function along motions of 
(38) is 


Vy, t) = + 2y'POGCy, Oy 
By Schwarz’s inequality and (16), 
By (23), (76), and (80), see Appendix, 


(40) 


2% 
Ca 
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~ If we assume further that 
(iii) ||GCy, uniformly in t with ||y|] 0 


then (see Theorem 1) V( y, ¢) will be a Lyapunov function also for 
system (38) in a sufficiently small neighborhood of the origin. 
Therefore we conclude that: 

If the linearized system (39) near equilibrium is uniformly 
asymptotically stable and is a uniformly good approximation to the 
original system near x,, then x, of (38) is (locally) uniformly asymp- 
totically stable. 

This result includes in it a method of considering small nonlinear 
terms in an otherwise linear system. This will be discussed in 
more detail in Example 15. 


RESISTIVE NETWORK 


'n 
- + + 
ems en 


Fig. 11 General RLC network 


Example 10. Passive Nonlinear Network. Consider a network of 
nonlinear inductors, capacitors, and resistors, without internal 
sources, mutual inductances, or ideal transformers. Let the first 
m elements be inductors, the next n-m elements capacitors, the 
rest resistors. See Fig. 11. Let e,,i;(j = 1,...,) be the volt- 
ages across and the currents through the energy storage elements 
with polarities as shown in Fig. 11. Using W to denote flux and q 
to denote charge, the mathematical description of the nonlinear 
inductors and capacitors is 


i; e; dy ,/dt 
e; = = dq,/dt j=m+i1,.. 


state variables z,,..., z,. The stored energy in the system is 
given by 


. n) 


n 


E = V(x) = Zz 


j=1 


e,i dt 


vi n i 
+ > f f(q;)dq; 


j=1 70 


n zi 


j=1 


(41) 


Assuming the network has an equilibrium state at x = 0, ice. 
f{0) = 0 
(41) will be a Lyapunov function if 


(j = 


(2;) > 0, fr J 


Viz) = et,;<0 


j=1 


and 
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The latter condition is always satisfied by passive resistive net- 
works; in other words, by networks containing only resistors 
whose conductance function is i = g(e) such that (i) g(0) = 0, 
(ii) g( —e) = —g(e), (iii) g(e) is monotonically increasing with e. 
See equations (6) and (6+) of Reference [30]. Thus every passive 
network is asymptotically stable in the large. While this fact may 
seem to be obvious physically, the simplest rigorous proof is sup- 
plied by the second method of Lyapunov. 

An interesting and at present largely unsolved problem which 
arises in connection with nonlinear control systems is the fol- 
lowing: Suppose first that all nonlinearities are functions of a 
single variable only. Let each nonlinearity g(£) be replaced 
formally by (“linearized’”’) g({) = K(&)E where either 


= g(&)/E 
or K(§) = dg(&)/d& = 


where K(£) is treated formally as a constant. Assuming sta- 
tionarity, this leads in general to a differential equation of the 
form 


(42) 
(43) 


dx/dt = F(x)x (44) 


Now suppose the matrix F(x) is asymptotically stable for all x. 
Can we conclude from this that the original equation x = f(x) is 
globally asymptotically stable? This has been widely conjec- 
tured in engineering circles for some time (in case of linearization 
(42) by Aizerman [31], in case of linearization (43) by Kalman 
[32]), and is probably true in most cases. However, no rigorous 
proof is available at present. More exactly, it is not known in 
precisely what cases and for precisely what reasons the conjec- 
tures fail. There are, however, a number of interesting partial 
results. 


f(x)) 
Fig. 12 System corresponding to Barbashin's theorem 
Example 11. Stability of a Third-Order Nonlinear System. Barba- 
shin [33, 34] considers the following system (see Fig. 12): 
= % 
de = (45) 
ds = —f(ti) — — | 


where f(0) = g(0) = O and f, g are differentiable functions. 
Using the “second method” in the form of Corollaries 1.2-1.3, 
Barbashin shows: 
The equilibrium state x, = 0 of system (45) is asymptotically 
stable in the large if: 


(i) a > 0 
(ii) f(m)/m1 = > #0 
(iii) 


If we replace f(z:)/2 and f’(2:) by a and by a, then 
the foregoing conditions become: 
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(i') a >O0O 


(iii’) aya, — a > 0 


(ii’) a 


which are just the Routh-Hurwitz conditions for the linear sys- 
tem 


+ Gok, + aid; + aor, = 0 


Notice, however, that conditions (i-iii) above involve both kinds of 
linearizations mentioned in the foregoing. 
Let 


and define as a candidate for a Lyapunov function of (45): 
V(x) = (21) + + + + (46) 


In view of (ii), f’(0) > 0, and F(z,) > 0 when z, + 0. Further, by 
(i) and (iii), it follows that G(z.) > 0 when z, ~ 0. When z, = 0 
(46) is obviously nonnegative and 0 only if z, = z; = 0. When 
zz ~ 0, the first three terms in (46) can be written as 


[2G(22) + + 4 f(z:) — 


The first term in the foregoing is always positive. By (iii) the 
inner integral above is positive, and therefore by (ii) the outer 
integral is nonnegative. Putting such facts together shows that 
V is positive unless x = 0. Because all integrands in question 
are uniformly bounded from below by (ii) and (iii), it follows 
further that V(x) © when @. Hence V satisfies con- 
ditions (i-A), (iii-A) and (iv-A) of Corollary 1.2. After a short cal- 
culation, it is found that 


Vix) = — 


By (iii), V(x) = Oonly ifz, = 0. Using exactly the same argu- 
ment as in Example 7, it follows that V(@(t; x, 0)) does not 
vanish identically along any trajectory starting at x ~ 0. This 
verifies hypotheses (ii;-A), (ii-A) of Corollary 1.3 which com- 
pletes the proof that (46) is a Lyapunov function for (45). 

Perhaps the best general result on stability of nonlinear systems 
(not restricted by dimensionality as in the foregoing example) is 
the following: 

Theorem 4. (Krasovskii [35]) Consider the continuous-time, 
free, stationary dynamic system: 


dx/dt = f(x) 


(0) = 0) 


ASSUME that f has continuous first partial derivatives and that 
its Jacobian matriz F(x) = [df;/dx,) satisfies the condition (€ > 0 is 
an arbitrary constant): 


(i) F(x) = (F(x) + F(x)] < —d <0 


(5-FS) 


THEN the equilibrium state x, = 0 of the system (5-FS) is 
asymptotically stable in the large and 


V(x) = 


is one of its Lyapunov functions. 
PROOF. Computing V, we find that 


Vix) = V) 


j=1 


= 2f'(x) F(x)x 


= <= — <0 
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the last step following by hypothesis (i). Thus V is negative 
definite. 

It remains to show that V is positive definite and tends to @ 
with ||x|| ©. 

Let ¢ be any constant, nonzero vector. The set of vectors 
{ac; 0 < a J 1} isa straight line (ray) connecting the origin with 
c. Integrating along this ray, we have the identity 


n 1 
Side) = ia | (46) 


j=1 i 
Suppose that f(¢) is 0 for some ¢ # 0. Then obviously 


0 = e'Ke) = ef 


t=1 


1 n 
aac) 
= | dz, «| da 


<0 


which is a contradiction. Hence V is positive definite. The pre- 
ceding argument shows also that x’f(ax) > — © with a— o, for 
any fixed vectorx ~ 0. But this can happen only if at least one 
component of f(x) tends to infinity in absolute value as ||x|| tends 
to infinity which completes the proof. 

The most obvious illustration of this result is the following: 

Example 12. Passive Linear Stationary System. In Example 10, 
in the linear case, one can define the state variables in such a way 
that the infinitestimal transition matrix F is negative definite. 
This is due to the fact that the terminal impedance matrix of an 
n-port network containing only linear, passive resistors is nega- 
tive definite. Then stability is immediate, by Theorem 4. In 
this special case, therefore, the Routh-Hurwitz inequalities are 
actually identical with the ones obtained by testing for negative 
definiteness. 

A less trivial example is: 


(x9) 


ox 


2 


Fig. 13 System illustrating Krasovskii‘s theorem 


Example 13. Second-Order System With Two Nonlinearities 
Fig. 13 illustrates the system 


i = + 


(47) 
where f;(0) = f.(0) = 0; f; and fy are differentiable functions. 


Here 
1 Se'(22) 2a 


To satisfy the hypothesis of Theorem 4 we must have 
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— (1 + > forall x 
fi'(m) S — € <0 for all x 


These requirements are considerably more restrictive than the 
formal Routh-Hurwitz conditions which would be 


(48) 


— >O0 foralla ~0, 22 ¥0 


(49) 
+a S —€< 0 forallz, ~0 
or, using the other type of linearization, 
— 2 > for allx 
(50) 


fi'(u) +a Ss —e€<0 forallz 


where (50) evidently implies (49). Using a special Lyapunov 
function, Krasovskii [36] was able to show that the asymptotic 
stability in the large of the origin of the system (47) is guaranteed 
even when conditions (48) are replaced by the first of conditions 
(50) and the second of conditions (49). On the other hand, as 
Krasovskii also shows, conditions (49) are not sufficient even for 
local asymptotic stability. 

It is not a simple matter to obtain Routh-Hurwitz-like con- 
ditions for nonlinear systems since it is not clear what kind of 
linearization is the most natural one. Much more mathematical 
research remains to be done on these problems. 


8—Estimation of Transient Behavior 


If the stability of a given system has been established by means 
of a Lyapunov function, the latter can be used to estimate the 
rapidity of transient response, effect of perturbations or variation 
in parameters, ete. All of these applications stem from regarding 
the Lyapunov function as a measure of distance in the state 
space. 

Consider the obvious inequality 

Vix, t) = (V(x, t)/V(x, t)]V(x, t) 
(51) 
—7nV(x, t) 


where 7 is the minimum of the ratio — Vix, t)/V(x, t) in some re- 
gion of the state space excluding the origin. For instance, in case 
of uniform asymptotic stability in the large, 


Inf 0 < rt 20; (52) 


if r is chosen so that B(||x0l|) < a(r) then the motion will never 
leave the region ||x|] < r so that the inequality (51) is applicable 
for allt = t. By elementary calculation, (51) leads to 


V(O(t; xo, to), t) exp [—m(t — V(xo, to) (53) 


Interpreting V as the distance from the origin, it is clear that (53) 
gives an estimate of how fast equilibrium is approached. 

In fact, 7~! can be interpreted as the largest time-constant over 
a certain region in state space and may be regarded as a figure of 
merit of the system. A number of points should be observed in 
this connection: 

(a) If V is only semidefinite, we may be able to conclude 
asymptotic stability by means of Corollary 1.3. But then 7 = 0 
and we cannot use (51) to estimate the rate of approach to 
equilibrium—in fact, then (51) merely implies stability. Fortu- 
nately, by Theorem 2, one can always find, in principle, a 
Lyapunov function with negative definite V in case of uniform 
asymptotic stability. Another situation where semidefinite V 
leads to difficulty occurs in the next section. 

(b) It may be useful to consider the values of 9 over several 
different regions of state space. For instance, in Example 2, 7 = 
2aV by equation (4). Since V + 0 with x — 0, it follows that, if 
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7 is defined by (52), it is zero. On the other hand, in any annular 
region r; ||x|| re, » = 2ar,?. 

(c) The specific value of 7 depends on the Lyapunov function 
chosen. Therefore one may want to choose V so as to maximize 
n. Little is known at present as to how this can be done. 

Example 14. Calculation of 7 in Linear Stationary Case. Con- 
sider a free, linear, stationary, asymptotically stable system. Let 
Q be an arbitrary symmetric, positive-definite matrix and find 
the corresponding symmetric, positive-definite matrix P by Corol- 
lary 3.1. Then 


V(x) = = (54) 
A definition of 7 analogous to (52) is then 
7 = = 1) (55) 
x 
and now the minimization can be performed with the help of the 
usual Lagrange multiplier technique. Let T be the Lagrange mul- 
tiplier; then minimizing x‘(Q — I'P)x with respect to x implies 
that the minimum occurs at a value x* of x such that 
(Q —TP)x* =0 (56) 
Therefore 


x*’/Qx* = I'x*’Px* = [> 0 


which is a minimum if T is. 
the matrix QP-'. Hence 


n = XAmin( (57) 


The inputs or perturbations acting on a control system are fre- 
quently not known accurately, although some estimate of their 
maximum amplitude may be available. Similarly, the function f 
may not be known exactly, or may change due to parameter varia- 
tions. Both effects may be represented by adding a term on the 
right-hand side of (5-F). In such cases, there may be no asymp- 
totic stability but one can ask the question, ‘‘What is the smallest 
spherical neighborhood about the origin which the motions of the 
perturbed system are sure to enter?’ The radius of this neighbor- 
hood depends on the bounds of the perturbation and the parame- 
ter variation and can be easily estimated using the “second 
method.” 

Write (5-F) in the form 


But, by (56), T is an eigenvalue of 


dx/dt = E(x, t) + g(x, t), 
where 
IIs(x, Sa < (58) 


Let V(x, t) be a Lyapunov function for the system (5-F) in ae- 
cordance with Theorem 1, with derivative V(x, t). Then the 
derivative of V for the system (58) is 


V(x, t) + 9’(x, t)[grad V(x, 
< + coll grad V(x, (59) 


If V and y are given in manageable form, one can then estimate a 
constant ¢;(co) such that if ||x!! > «, then expression (59) is 
negative. 


Example 15. Effect of Perturbations and Parameter Variations in the 
Linear Stationary Case. Consider the system 


dx/dt = Fx + G(x)x + u(t) (60) 


where 


\|G(x)|] < 
< 


Assume that F is asymptotically stable and define a Lyapunov 
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function by means of Corollary 3.1, as in Example 14, equation 
(54). Then 


Vix, t) = + 2[x’G’(x) + u’(t))Px (61) 
By Schwarz’s inequality, 
(x'G'(x)Px| ||Px| | 
and < 
Hence (61) becomes 
V(x, t) + col! + (62) 
Now observe that (proof as in Example 14) 
= < Amax(P)||x!!4p 
Using this, it follows, 
Vix, t) S + + (63) 
where 7 is as defined in Example 14, 
Cx(Co) = 
and 
= (P) 
Therefore if 


= — (64) 


then V is negative. 

The quantity [7 — 2c:(co)]~' can be interpreted as the upper 
bound on the time constant taking into account the parameter 
variations. The upper bound of the radius of the region which 
the motions are sure to enter is proportional to this quantity and 
to the bound on the noise u(t). 


$—Relations With System Optimization 


An important aspect of the “second method” of Lyapunov is its 
close relationship with problems in the dynamic optimization of 
control systems. This interrelation has not been explored so far 
in detail even in the Russian literature and provides much in- 
centive for future research. 

Briefly, the situation is as follows. Let p(x) be a continuous, 
nonnegative function which serves as the error criterion of a 
regulator whose purpose is to maintain the system at all times as 
close as possible to the equilibrium statex, = 0. Thus p(0) = 0. 


For instance, one could take p(x) = z,%, or p(x) = 1 — exp (—z,*), 
n 


or p(x) = pa |z;|, ete. 
i=1 
Now define the performance index of the system as the inte- 
grated error criterion (assuming stationarity): 


V(x) = x, 0))dr (65) 


This integral is a positive number and can serve as a figure of 
merit of the system. (When the integral is infinite, it is useless; 
this can be easily avoided by replacing the upper limit of integra- 
tion by 7, where 7 denotes an interval of time over which the 
behavior of the system is of interest.) 

If the regulator contains adjustable parameters p these can 
be picked in such a way that the value of V(x, p) is minimized 
with respect to those parameters for one or more initial state x. 
This formulation of the optimization problem contains the well- 
known Wiener-Hall theory [37]. 
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Now the mere fact that the integral (65) is minimized by a 
physically realizable system does not imply at all that this system 
is also asymptotically stable. The latter will be the case, however, 
if V(x) is a suitable Lyapunov function. 

To be more precise, consider only linear stationary systems 
(5-LS). Because p is nonnegative and the integrand is continuous 
in 7, the integral in (65) can be zero only if 


p(o(r; x,0))=0 forallr 20 (66) 


Assume now that p(x) is positive definite. Then (66) implies, 
using also linearity and stationarity: 


o(r; x, 0) = (exp Fr)x = 0,7 20 


Since the transition matrix is nonsingular, this implies x = 0, 
which shows that V is positive definite. 

Assuming further that V is finite in some neighborhood lx] Sr 
of the origin, a simple calculation (as in (24)) shows that in the 
same neighborhood 


Vix) = —p(x) <0 whenx 


ie., V is negative definite. Therefore under the assumptions V 
satisfies requirements (i-A), (ii-A) of Corollary 1.2 so that (5-LS) 
is asymptotically stable; by linearity, it is then also asymptoti- 
cally stable in the large so that the restriction about a finite 
neighborhood of the origin made above is immaterial. Thus we 
have obtained the following simple but useful result [16]: 
Theorem 5. (Kalman) Consider a free, linear, stationary dynamic 
system with an equilibrium state at the origin and ASSUME 


(i) The error criterion p(x) is positive definite and p(Q) = 0; 

(ii) The performance index V(x) defined by (65) is finite in some 
neighborhood of the origin. 

THEN «x, = 0 is asymptotically stable. 

The assumptions on p(x) could be considerably weakened, of 
course; the extension of the theorem to the linear nonstationary 
case is completely straightforward. 

The importance of this theory is emphasized by the following 
facts: 


(a) In the conventional theory of optimization the assump- 
tion of positive definiteness of p is missing. For this reason, the 
theory in its classical form is unsatisfactory since there is no proof 
in general that the optimized system is stable. In the simple 
cases usually discussed in textbooks, this omission is actually not 
serious due to the accidental fact (cf. here Examples 7 and 11) 
that often even positive semidefinite error criteria imply that 
V(x) is a Lyapunov function since the error criterion does not 
vanish identically along any trajectory (Corollary 1.3). But this 
is much less likely to happen in systems with several inputs and 
outputs—one of the reasons why relatively little progress has 
been made in the theory of multiloop systems. These matters 
are discussed in detail in a recent paper of Kalman (16, 41). 

(b) If V(x) can be made finite by some choice of the parameters 
of control system, then it follows that the optimal system will 
also be stable since V(x) < V(x). But if the open-loop system 
is unstable, it is by no means obvious that V.,:(x) can be made 
finite for all x and thereby stabilize the system. In fact, in some 
nonlinear systems [38-40], one can at best get local stability in 
this case. However, Kalman has shown [16, 41] that, if the 
definition of the control problem is physically meaningful at all, 
then in the linear case an originally unstable system can always 
be stabilized by a suitably designed controller. 

(c) Note that Theorem 5 is not based on any special assump- 
tions concerning the error criterion p, for instance, that it should 
be a quadratic function of the state. From the point of view of 
getting general results, the particular form of p is unimportant; 
unfortunately, however, the problem of minimizing V(x, p) is 
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usually difficult to carry out and leads to a nonlinear feedback 
system whenever p is not quadratic. 

(d) By minimizing V(x, p)/p(x) with respect to p, one auto- 
matically obtains the best possible upper bound on the time 
constant of the system as discussed in Section 8. 

(e) When p(x) = 2,?, the classical method of evaluation of (65) 
in the linear stationary case is based on the identity 


0 


where X,(iw) is the Fourier transform of z(t). The right-hand 
side of the integral is then evaluated by means of the calculus of 
residues; the resulting formulas have been tabulated [37]. 
Some may regard this as a clever application of the calculus of 
residues. However, the same task can be solved in a much more 
general and elementary way by applying Corollary 3.1, i.e., by 
formulating the problem in the “right’’ way from the stated point 
of view and then solving the linear algebraic equations (19). Thus 
Corollary 3.1 can be used not only as a method of checking sta- 
bility but also for system optimization. The following elementary 
example, suggested by Herschel [42], illustrates this: 

Example 16. Conventional Optimization of Second-Order Stationary 
Systems. Consider the second-order linear stationary system in 
the output 2: 


4 + +2 = 0 


or = 2 
(67) 


= —2, — 


where ¢ is an arbitrary constant. 
The problem is to minimize the performance index 


1 0 


Using Corollary 3.1, the solution of equation (19) is 


where 


or 


V(x, = + (21? + 22") + 122 (68) 


Suppose now that the constant {(4) is chosen in such a way 
that V is a minimum when z, = 0. This leads to 


= (W1 + 


as the optimal choice of ¢. 

Notice that ¢*(0) = 0.5, a well-known result. The objection 
has been frequently raised that this type of optimization, (i.e., 
using p(x) = 2%) yields a system which is too oscillatory. The 
usual “rule of thumb” is given to be f =0.7. This value is given 
by ¢*(1) = 1/\/2 = 0.707. Thus the use of a positive-definite 
error criterion answers some of the objections raised against op- 
timization based on the integrated squared error. 
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If ¢ could be varied instantaneously, then we could just as well 
write system (67) in the form 


= Ze 
= + w(t) 


where u(t) = ¢(t)ze. If there are no limits on ((¢) then there are 
also no limits on u,(t) and then Kalman’s theory of optimization 
[16] (see also reference [15]) applies. It follows that V(x) for 
this system can be made arbitrarily small by letting u(t) be some 
linear function of the state: 


= a,x,(t) + 


and therefore we would have also 
O*(t) = ari(t)/z(t) + ae 


But even if ¢ must be a constant always, it follows that the 
foregoing optimization procedure is not satisfactory, because one 
should conceivably still select ¢ as a function of the initial state x. 
For instance, if the initial state is z, = 0 and z, arbitrarily, then 
the optimum value of ¢ for this initial state is ©. 

Example 17. Strict Optimization of Second-Order Stationary Systems. 
A more satisfactory approach to the problem of the preceding 
example is to regard the optimization as a (two-person, zero-sum) 
game between Nature and Man. Nature chooses the initial state 
x; Man chooses a (constant) value of f. Nature attempts to 
maximize and Man to minimize the time constant: 


= V(x, £)/p(x) 


It is intuitively obvious and easily proved [43] that if Nature 
plays first, the payoff (i.e., the value of 7) will be at least as great 
as it would be if Man were to play first. This is expressed by the 
well-known inequality [43]: 


ts = Max{ Min[V(x, 
< Min {Max[V(x, ¢)/p(x)]} = 7* (69) 


Here 7¢ is the minimum that Nature is sure to “get’’ and r* is the 
maximum that Man may have to “pay.” 

The payoff function (68-69) of this continuous game is conver 
in the strategies [; for this class of games a fairly complete theory 
is available (Reference [43], Chap. 12). It can be easily com- 
puted that here actually r* = r*. This means that the game has 
a saddle point, i.e., a solution of the game consists of pure strategies 
x* and ¢* which are the values of x, £ at which the Min-Max (or 
Max-Min) of V(x, £)/p(x) is attained. An elementary but some- 
what lengthy calculation shows that, when wu = 1, 


= V(1+ = 
V3 — V5 = 0.437]|x|! 


Te = = 4 + + 1) = 1.665 


Since the value of the game does not change if x is replaced by 
ax (a # 0), the optimal strategy x* is determined only up to a 
constant multiplier. 

Thus we have a solution of the optimization problem which 
does not depend on any arbitrary assumptions as to the initial 
state; 7* is the minimum of the maximum possible values of the 
time constant. This agrees well with the “rule of thumb” 
quoted above. 

Such problems have not been considered in the literature so far 
and it seems to be an open question as to whether the optimal 
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solution of games of this type always results in pure strategies. 


10—Applications to Design 


The “second method”’ can be used in a variety of ways in con- 
nection with system design. A particular advantage of this way 
of approaching design is that one has then a direct way of assuring 
stability. A full discussion of these matters is given in the books 
of Lur’e [7] and Letov [8] cited before. 

To give an idea of the possibilities present, we discuss briefly 
the problem of linear compensation of control systems subject to 
saturation. 

Consider a plant governed by the equations 


dx/dt = Fx + Du(t) (70) 
where the control variables are subject to the constraints, 
Jugt)| <<a,< @ ym) (71) 


This is essentially the relay or saturating servo problem. The 
problem is to return every initial state to the origin. 

It is well known [38-40] that if F has eigenvalues with positive 
real parts then there are some states which cannot be returned to 
the origin by any contro] subject to the constraints (71). Hence 
it is assumed that the free system in (70) is asymptotically stable; 
i.e., that Re \,(F) < O for alli. The case where F has eigenvalues 
with zero real parts can be treated similarly, albeit much less 
simply. 

Under these assumptions, consider the problem of optimizing 
the transient behavior of (70) -by suitable choice of u(t). A 
rigorous optimization, such as minimization of an integral per- 
formance index (65) or taking any initial state to the origin in 
minimum time, will lead to u(t) being a nonlinear function of the 
instantaneous state. This problem is not yet completely solved. 

The following method, suggested by Bass [44], leads toa very 
simple design procedure of practical significance. 

Choose Q > 0 arbitrarily, and employ Corollary 3.1 to get 
P>0. Then ||x||*» = V(x) is a Lyapunov function for system 
(70), with u(t) identically zero. Now choose u(t) so as to make 
V as negative as possible. From (70) 


V(x(t), t) = + 2u’()D’Px(t) 


The second term in this expression wil] be a minimum whenever 
each control variable u,(t) has its maximum magnitude, with a 
sign opposite to that of the corresponding component of the vector 
D’Px(t). Hence 
u,*(t) = —a, sgn (D’Px(t)), = 1,...m) (72) 
where sgn is as defined in (21). 
This method of design is not a true optimum since 


Min { V(x, u(t))/[— V(x, 
u(t) 


< V(x, u(t))-{Min{1/— V(x, u(t))]}} 


On the other hand, the design has the advantage that it leads 
to a system in which there is only linear feedback preceding the 
nonlinearity. 

The result (72) is deceptively simple and it is important to bear 
in mind two major mathematical difficulties. 

First, if the optimal control defined by (72) is substituted into 
(70) there is no guarantee that solutions of the differential equa- 
tion will exist, since sgn is a discontinuous function. This is a 
well-known phenomenon in the theory of relay servos which was 
first discovered by Fliigge-Lotz [45] and later studied in con- 
siderable detail by André and Seibert [46]. However, the ‘second 
method”’ still applies, after a small modification in the definition of 
V; the conclusions so obtained agree with a rigorous treatment 
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of the relay servo problem. The difficulties just mentioned may 
be avoided by replacing (72) with 


u;* = —a,; sat k(D’Px); = u,*(KD’Px) (i = 1,...,m) (72-A) 


where 
+1, z>1 
sat z = z, 
-1, z<-1 


and letting the positive constant k be as large as desired. This 
provides an arbitrarily good approximation to sgn z. 

Second, the design represented by (72-A) is realizable only if 
all state variables can be measured. If this is not the case, the un- 
measurable state variables must be “synthesized’’ from the 
measurable ones by simulating [12] a part of the dynamic equa- 
tion (70). The necessary simulation equipment (in essence, a 
smal] computer) then becomes part of the controller itself (see 
Example 18 and Fig. 14). However, it is then not at all obvious 
as to whether the over-all system, including the simulator, wil] be 
asymptotically stable. Moreover, there may be difficulties due 
to inaccurate knowledge of F and D. 

To examine these questions more fully, let y denote the meas- 
urable and z the unmeasurable state variables. Then (70) can 
be written in the form: 


t 
where F,, and D; (i, j = 1, 2) denote submatrices of F and D de- 
fined in the obvious way. Further, let z denote the “synthetic” 
state variables, and F.,, 6; the best available estimate of the 
quantities F,,,D,. Use the tilde to denote the error in these esti- 


mates such as z =z — Z,etc. Then the equations of the“ simu- 
lator’’ are: 


diz /dt = Fny + Fut + 


which contains only known quantities. (Assuming there are no 
random disturbances, u(t) is always measurable.) 

The form of the optimal control is still represented by (72-A) 
but now D is replaced by 6 and P by P, the latter being calculated 
via (19) from Q and F. Then (with some obvious notation con- 
ventions) the over-all system equations become: 


(a) dx/dt = Fx + But(kO'P(x + 2)) 
+ Fx + + (73) 
(b) dz/dt = Fat + F..x 


Equations (73) define a dynamic system in the state space con- 
sisting of all ordered pairs of vectors (x. z). The two equations 
are loosely coupled due to errors in the prediction of z and in the 
knowledge of F and D. We assert: 

Theorem 6. (Kalman) The free, stationary dynamic system (73) is 
asymptotically stable in the large, for arbitrary k = 0, if the follow- 
ing conditions hold: 

(i) Re A Pax) < 0 for all i. 

(ii) F, D are sufficiently small. 


While the theorem is just what one would expect intuitively, the 
proof is not trivial because of the nonlinear function u*. (If u* is 
linear, the proof is an immediate consequence of Theorem 3 and 
the methods of Example 15). 

PROOF. (a) Consider first the system (73-a), with z = 0. 
By virtue of (72-A), the following function is clearly positive 
definite for all cs 2 0,k = 0: 


V(x) = x’Px — cox’ PBu*( kB’ Px) 


By simple but somewhat tedious considerations it is verified that 
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V(x) is negative definite, provided that (ii) holds and c is chosen 
suitably small. 


The main steps are as follows. First, note that 


Vix) < —ei|{x||? kD’ Px)! 2 
+ (ccs + 


where 
= Min {x(Q — 2F’P)x; ||x\| = 1} 
c = Min {x’((O + B)’PO)x; = 1} 
2\\F’ PO!) 
a 2) PD) 


Evidently ec, c, can be made positive and c, can be made arbi- 
trarily small by virtue of assumption (ii). 
V will be negative definite, independently of k > 0, if 


> (Cocs + c4)?/4 


which can be satisfied by taking co, cs sufficiently small. 

(b) Next, consider the system (73), with F,, = 0, in other 
words, (73-a) is coupled to (73-b), but not vice versa. Let M be 
the matrix which corresponds to Q = land F» through equation 
(19). By (i), M will be positive definite. 

Now define, as a tentative Lyapunov function for the entire 
system (73), 


W(x, 2) = V(x) + col! (74) 


In view of part (a) and noting that lu*(x + z)|| < |lu*(x)|! + 
\|u*(z))|, the derivative of W can be written as 


W(x, z) < —cel|x||? Px ||? 


+ [es||x|| + z 
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Fig. 14 Design of relay control system 


where 


= | PD’), 


Since evidently 


O < < Cote 
Co = 2 


\|w*(kD’Px < 2cyok!| 


it follows that W is negative definite if the matrix 


0 CsCiok 
0 —Ci 
CeCiok CoCiok 


is negative definite. This will be true if c, is large, of the order of 
k*. But the choice of c; > 0 in (74) is arbitrary. Hence for arbi- 
trarily large, but fixed, k there is a Lyapunov function of type 
(74) such that 


W(x, z) < —en(||x!!? + ||z!/2) 


(ce) Finally, consider the case when F,, #0. Since by part (6), 
(73) without this perturbation is asymptotically stable, with W 
and W being bounded by quadratic forms, hence by the methods 
of Example 15 it follows that the perturbed system (73) is still 
asymptotically stable when F;. is sufficiently small. Q.E.D. 

The magnitude of the permissible errors F and BD can be caleu- 
lated but this is of little interest except in specific cases. 

It must be borne in mind also that asymptotic stability of F by 
no means implies that of F 2. 

Example 18. Design of aRelay Servo. As a numerical illustration 
of the foregoing design procedure, consider the plant specified by 


—0.01 1 0 0 
F = 0 0 1 , D = 0 
0 —2.26 —0.2 2.5 
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It is easy to see that the transfer function of the plant from 
u(t) to x(t) has no zero, a real pole at —0.01 and a pair of com- 
plex conjugate poles at —0.1 + i1.5. Further, assume that 
}u;(t)| 1 for allt. 


Let Q = I and find P by means of equation (19). Numerical 
solution leads to 
50.00 
4.64 
22.10 


4.64 22.10 
9.71 2.28 
2.28 13.88 


P = 


From (72), it follows that the optimal control is given by 
u*(x) = sgn{—22.10z, — 2.2827, — 13.882,] 


Suppose further that only the output z,(¢) can be measured; 
the rest of the state variables are to be “‘synthesized.’”’ This leads 
to the block diagram of Fig. 14. Since F» is obviously asymptoti- 
cally stable in this case, it follows that the over-all system shown 
in the figure is asymptotically stable in the large for small errors 
in predicting the coefficients of F and D. 

A typical transient response of the uncontrolled plant (i.e., 
u;(t) = 0 for t > 0) is shown in Fig. 15, where z,(t) is plotted for 
the initial states x» = [0, 0, 10]’, [0,0,3]’. Note that because of 
the real pole at —0.01, z,(t) returns to 0 very slowly. With op- 
timal contro] and the same initial states z,(t) goes to zero in about 
30 sec even for the largest z»;—an improvement by about a factor 
of 10 over the uncontrolled system. 


11—Conclusions 


The following aspects of the ‘‘second method’’ of Lyapunov are 
perhaps most significant in the theory of control systems: 


(a) The “second method”’ provides an abstract tool for studying 
stability and transient behavior of dynamic systems without 
solving the differential equations of these systems. 

(b) Using a Lyapunov function instead of a norm, one has 
rigorous methods of analysis of nonlinear systems provided that 
they are (as in most control-system applications) asymptotically 
stable in the large. This refutes the oft-repeated misconception in 
the engineering literature that “nonlinear systems cannot be 
analyzed by exact methods.” 

(c) The abstract method becomes a concrete one whenever 
explicit expressions can be found for a Lyapunov function. This 
is always possible in the linear stationary case, by solving a set of 
simultaneous algebraic equations of order n(n + 1)/2. In the 
linear nonstationary or nonlinear cases, no straightforward 
methods are available at present for doing this. Further applied 
mathematical research should be directed toward developing ef- 
ficient digital computer programs for finding Lyapunov functions. 

(d) Given a fixed system, the measure of “goodness” of the 
Lyapunov function is the number Max V/(— V) (over some re- 
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gion in the state space); this number can be interpreted as the 
rigorous upper bound on the time constant of the system. 

(e) Given a system with free parameters, it is natural to 
identify — V with the error criterion to be used for the system. 
Then V may be identified with the performance index of the sys- 
tem and is to be minimized by proper choice of the free parame- 
ters. 

(f) The optimized system is stable only if the error criterion 
does not vanish identically along any trajectory. This added 
condition is necessary to assure the correctness of the usual 
Wiener theory of optimization. 

(g) The “second method” of Lyapunov is more a unifying 
principle than a method. It does not replace ingenuity. On the 
other hand, in many cases Lyapunov functions are already availa- 
ble, though unrecognized, in standard results in control theory. 
These results can then be used in other ways to estimate transient 
response, effect of random perturbations, and so on. 
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APPENDIX 


The object here is to present in sufficient detail all the relevant 
definitions and facts which are needed in the manipulation of 
norms in applications of the ‘‘second method,’’ proofs of theorems, 
and so on. While these results are very well known, there is no 
single reference which covers all the necessary points. 

The notation at the beginning of Section 4 is used throughout. 
In calculations, vectors are regarded as matrices with one 
column. All vectors, matrices, etc. refer to finite-dimensional 
Euclidean spaces. 

By a positive-definite (positive semidefinite] matrix A we mean 
any n X n matrix such that the quadratic form 

n 
i,j=1 
is positive [nonnegative] for all x # 0. If A is positive-definite 
{semidefinite] then —A is negative-definite |semidefinite|. It is 
not required as a rule that A be symmetric (A = A’) but note 
that if B is any antisymmetric matrix B = —B’ then x'Ax = 
x'(A + B)x. 

The eigenvalues \,( A) of the n X n matrix A are the n-complex 
roots of the polynomial det(A — Al). A positive-definite matrix 
P always has an inverse, in fact A,(P) > 0 for all 7. Note also 
that 4,(AB) = A,(BA) and Af = 

It can be shown that a matriz A is positive-definite {semidefinite] 
if and only if any of the following conditions hold: 

(i) There is a nonsingular [singular | matriz B such that B’B = 
A: 


(ii) > 0 = 0] for all i: 

(iii) all principal minors of A are positive [nonnegative]. 

The concept of the norm is similar to that of the absolute value. 
It is needed in order to be able to make statements about con- 
vergence, continuity, etc. in Euclidean space. Standard ref- 
erences are: Halmos [47], chapter 3; Householder [48]. 

Specifically, a norm is a function which assigns to every vector 
(or point) x in a given Euclidean space a real number ||x!| such 
that 


(Ni) 2 0 for all x 

(N2) ||x + yl] ||xl] + |ly|| for all x, y 

(Ns) = for all x and complex constant a 
(N,) |{x|] = 0 implies x = 0 


If axiom (N,) is missing, then ||x|| is a seminorm. Examples: 


(a) Wall = fed 


t=1 


(b) = = (x’x)' 


t=1 


This is the standard definition of the distance of a point in a 
Euclidean space from the origin, also called Euclidean norm. 
(c) ||x\|a = (x’Ax)'/* where A is symmetric and positive 
definite; called here the generalized Euclidean norm. 
(d) ||x|| = Max {|z;]} 
+ 


In explicit calculations, ||x|| always means the Euclidean norm. 

If A is positive semidefinite then ||x|| 4 is a seminorm. 

By a topology on a mathematical space 3 we mean a collection 
of subsets of 3¢ (called open sets) satisfying certain axioms which 
serve to define abstractly the concept of nearness between ele- 
ments of %. For a Euclidean space, a topology is usually defined 
as follows: Call the set of all points ||x — y|| <r an open r-sphere 
about y. The topology then consists of all r-spheres (0 << r< ~) 
about every point x and their unions and finite intersections. A 
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Euclidean space together with a given norm is then called a 
normed space. 

Convergence and continuity in a space is always defined with 
respect to a given topology. Two topologies are equivalent if 
every open set of one topology contains an open set and is con- 
tained in an open set of the other topology. Two norms in a 
given finite dimensional Euclidean space define equivalent 
topologies since it is always possible to find two positive con- 
stants ¢,, such that [48] 


s s for all x" 


For example if ||x||, is defined as in (a) above and |!x||, as in (b) 
above, then it follows from elementary calculations (ref. [11], p. 
17) 

n 


t=1 


(75) 
The method used in Example 14 proves also the following re- 
lations between generalized Euclidean norms: 
Amin“ < S QP-*)||x||p 
(Q, P symmetric and positive-definite). 


Closely related to the concept of the norm of a vector is the 
norm of a matriz. This is defined by 


|| Al] = Min K such that ||Ax|] < 
For the Euclidean norm, this definition is obviously equivalent to: 


||Al|? = Max{x’A’Ax; x’x = 1} 


(76) 


14 Therefore, in questions of convergence, stability, etc., it is imma- 
terial as to what norm is used. 
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The following inequalities are obvious from the definition 


|A’|| = (78) 


An explicit formula for the Euclidean norm follows at once 
from the second definition and formula (76): 


|| Al]? = Amax(A’A) (79) 


Since A,(A*) = AA), for a symmetric matrix A the expression 
for the Euclidean norm is 


JAI] = = = (80) 


In other cases explicit expressions for the norm are usually not 
available. 


One can easily compute some (more or less pessimistic) upper 
bounds corresponding to the various norms defined above 


s load 


j,t=1 


All s (> at) 


j,t=1 


i=l 
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Control System Analysis and Design 
Via the “Second Method’’ of Lyapunov 


The second method of Lyapunov is applied to the study of discrete-time (sampled- 


data) systems. With minor variations, the discussion parallels that of the companion 
paper on continuous-time systems. Theorems are stated in full but motivation, proofs, 
examples, and so on, are given only when they differ materially from their counterparts 


1—Introduction 


W. APPLY HERE the considerations of the companion 
paper,’ ‘Control System Analysis and Design Via the ‘Second 
Method’ of Lyapunov. I Continuous-Time Systems,” to the 
study of dynamic systems in which the time variable changes 
discretely, i.e., those governed by ordinary difference (rather than 
differential) equations. 

In order to avoid needless duplication, all equations, theorems, 
examples marked by an asterisk will refer to corresponding 
items of the companion paper. Other equations, etc., will be 
numbered consecutively. All theorems will be stated in full, 
but examples, proofs, etc., will be given only if they differ in a 
nontrivial way from their continuous-time counterparts. A few 
results peculiar to the discrete-time case are included, however, 
with full discussion. 

The need for a separate presentation is due to the increasing 
importance of sampled-data systems in connection with digital- 
computer control. The development of the study of continuous- 
time and discrete-time systems has tended to drift apart some- 
what as a result of the popularity of the z-transform approach 
to the latter [1].4 We shall endeavor to emphasize here the es- 
sential unity as well as some of the peculiar contrasts of the two 
subjects, from a point of view rather different from the z-trans- 
form. 


2—Outline of Contents 


In Section 4 we present a method of description of discrete- 
time dynamic systems which is analogous to the terminology now 
current in mathematics for the description of continuous-time 
dynamic systems. The important concepts are those of the state 
and state transition—these notions are indispensable not only for 
the application of the “second method”’ but even for a rigorous 
definition of stability. Section 5 recalls the customary definitions 
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in the continuous-time case. 


of stability; these are fully identical with the ones used in the 
continuous-time case. 

Section 6 presents the main theorems on which the “second 
method’’ is based. Again, the analogy with the continuous-time 
case is complete and the reader is referred to the companion paper? 
for motivation and proofs. 

Section 7 deals with applications to stability theory. Example 
6* presents the derivation of the Routh-Hurwitz inequalities via 
the ‘second method.’’ Since these inequalities are not well known 
in the discrete-time case, a full derivation is given. Theorem 4* 
and Example 2 illustrate the concept of a contraction; this simple 
notion leads to useful nonlinear stability criteria which are at 
present almost unknown in the sampled-data systems literature. 

Sections 8, 9, and 10 are analogous with those of the companion 
paper, with minor variations in the details of the derivations. 


3—Guide to Western Literature 


There is very little literature on the stability theory of systems 
governed by difference equations, since such problems are usually 
trivial special cases of the stability theory of continuous-time 
systems. An early paper is that of Ta Li [2]. Recently, Hahn [3] 
has provided a self-contained summary of the application of the 
“second method’”’ of Lyapunov to difference equations; this ma- 
terial is also included in his book ‘'4]. 

An important prerequisite to the application of the “second 
method”’ is that system equations must be given from the “‘state’’ 
point of view. Often these equations cannot be obtained in a 
natural and convenient way from the z-transform [1] type of 
description of discrete-time systems, particularly when the system 
contains dynamic elements of two types—those governed by dif- 
ference or by differential equations. A detailed treatment of the 
derivation of the discrete-time dynamic equations in cases of 
current practical interest has been given recently by the authors 
[5]. This paper provides a complete treatment of the linear case, 
including the analysis of sampling operations of complicated type 
(see also Reference [6]). 


4—Description of Discrete-Time Dynamic Systems 


Small boldface Roman or Greek letters will denote vectors. 
Capital boldface Roman or Greek letters will denote matrices. 
Unless otherwise specified, all scalars and all elements of vectors 
and matrices will be real numbers. The prime denotes the 
transpose of a vector or matrix; thus the inner (scalar) product is 
written as x’y and a quadratic form as x’Ax. The norm is de- 
noted by |x|] ; in specific calculations this means always the 
Euclidean norm (x’x)'/*. The eigenvalues of a matrix F are 
F). The unit matrix is |. 
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In this paper, we shall study systems governed by the vector 
difference equation. 


= W(x(t,), te) 


where the ¢, indicate discrete values of time (k = integer) 


ta << ben, ... < + © whenk 


at which the behavior of the system can be or is observed; t, 
is regarded as an independent variable analogous to ¢ in the con- 
tinuous-time case. Of course, (5*) is equivalent to the set of n 
scalar difference equations 


2 tess) = u(t), oe ty), t) (5*) 


The vector x is the state of the system (5*), its components z, are 
the state variables. The vector u(t,) is the control function (or 
forcing function or input) of the system (5*); its components u,(t) 
are the control variables. The system is specified by the vector- 
valued function h. It is always assumed that h is a 1-to-1 func- 
tion for any fixed u and ¢,, continuous in all of its arguments. 
The integer n is the order of the system. Usually,m <n. To 
assure that (5*) represents a physical system, we must require 
also that if — o, then h (x, u, t,) x. 
If u(t,) = 0 for all t,, we say that (5*) is free (unforced): 


= W(x(t,), (5*-F) 


Equation (5*) defines a discrete-time dynamic system. Unlike 
in the case of continuous-time dynamic systems which are usually 
specified by their infinitesimal state-transitions (i.e., by differen- 
tial equations) there is no difficulty (i.e., no finite escape time) 
here to obtain from the primary description (5*) all possible state 
transitions. 

For any initial state xo, any initial time t&, and any time ¢,, we 
define by induction the function 


O(t,; Xe, 4) = x(t,) 
xo, = xo forall xo, & 
O( x(t,), ) = W(x(t,), &) for all 


and some fized sequence of values u(t), . . 
forcing function 

We call O(t,; xo, &) (where ¢, takes on all possible discrete 
values) the motion (or trajectory) of the system (5*) going 
through state x» at time & Since the function h in (5*) is 
unique and one-to-one, it follows that for any x» and any discrete 
values t,, t,, t, of time 


O(t.; Xo, t,) = ; Xo, ty) 


If h is not one-to-one, (8*) holds if ¢, < t, S ¢,. It is usually not 
necessary to indicate explicitly the dependence of the motion on 
the foreing function. 

A state x, of a free dynamic system (5*-F) is an equilibrium 
state if 


satisfying 
x(t,), ty, 


u(t,),... of the 


(8*) 


h(x,,%) = x, forall ¢, (10*) 


A dynamic system (5*) is stationary if h(x, u, t,) = h(x, w), 
i.e., h is not an explicit function of t,. Then we take for simplicity 
tes: — t, = constant forall k. Ifa system is both free and station- 
ary, it is aulonomous. 

A dynamic system (5*) is linear if h is a linear function of x 
and uw. 

If x, = 0 is an equilibrium state and if (5*) is linear, then we 
can write 

= A(t,)x(t,) + A(t,)u(t,) (5*-L) 
where 
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H(t,) = 
A(t,) = [0h(t,)/du,] 
Let us define 
=) 
= H(t,)... H(t) 


forall 


for all 


(7*-L) 
t, & ¢, 
As in the continuous case, the matrix @®(t,, &) is called the 
transition matriz of (5*-L); but now its arguments t,, & are de- 
fined for discrete values of time only. 
Now we can write a general expression for motions of (5*-L), 
valid for all t, = to 


toSti<ts 


If H(¢t,) is nonsingular for every t,, then its inverse is 
H-“t,) = &) = Ob, 


Then the transition matrix ®(t,, ¢,) is defined also for t, > t, and 
we can write for any discrete f,, t,, t., 


t,) = P(t, t,) (8*-L) 


and also 
®~\(t,, t,) = P(t, t.) 


If H(¢,) is singular, then (8*-L) holds if we require that ¢, < 
St, 

If the linear system (5*-L) is stationary, then H(t,), A(t,) are 
constants and @(t,, &) depends only on the difference t, — b. 

Example 1—Discretization of Continvous-Time Systems. A discrete- 
time dynamic system may arise from a continuous-time dynamic 
system by simply considering the latter at discrete instants of 
time only. Then our present notation coincides exactly with that 
of the companion paper.* 

But not every discrete-time system can be regarded arising in 
a natural way from a continuous-time system. For instance, if 
®(i,, to) is the transition matrix of a linear, stationary, discrete- 
time system, obtained from a linear, stationary, continuous-time 
system, then we would have 


(9°) 


to) = V(t, to) = exp to) F (1) 


But it is well known that the foregoing equation cannot be satis- 
fied with any real, constant matrix F if @(t, — %&) has any real, 
nonpositive eigenvalues. A simple instance of this is the equa- 
tion 


= —2(t,)/2 


Note also that the discrete version of a continuous-time sta- 
tionary system is stationary only if 44, — t, = constant for all ¢,. 

Example 2—Sampling Systems. A sampling (or sampled-data) 
system is one in which some variables are allowed to change only 
at discrete instants ¢, of time. These instants may specify the 
time at which some physical measurement (say, radar pulse) is 
performed, the time at which the memory of a digital computer is 
read out, etc. There are several different types of sampling opera- 
tions of practical significance: 

(a) Periodic (conventional) sampling: t, = kT, where T = 
constant > 0. 

(b) Multiple-order sampling: The pattern of t,’s is repeated 
periodically, i.e., 4. — 4, = constant for all k. 

(c) Multirate sampling: In case of two concurrent sampling 
operations, t, = or where 7), T: are constants and p, are 
integers. 

(d) Random sampling: t, is a random variable. 

In case (a), the discrete dynamic system is stationary; in the 
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remaining cases the system is nonstationary in general. In case 
(b), the system can be made stationary by considering only 
sampling points ¢,, ti+r, liver, ... In case (c), the system can be 
made stationary by taking samples L(7), 72) units of time apart 
where L(T;, T2) is the least common multiple of T;, 72; this fails, 
of course, if L = ice., if is irrational. 

The “second method” of Lyapunov, as usually stated, applies 
only to free dynamic systems with equilibrium state at 0. 
More generally, this means that the method is concerned only 
with deviations about some fixed motion (see the companion 
paper? for further details). 


5—Concepts of Stability 


The definitions of this section, as well as their motivation, are 
identical in every respect with those of the companion paper* 
except that the concept of finite escape time is missing. 


(S;) An equilibrium state x, of a dynamic system is stable 
{[(Se): uniformly stable] if to any number € > 0 there corresponds 
a number 4(€, to) > 0[6(€) > 0] such that if |! xo — x,|| < 6 then 
x0, to) — x,|| S € for all t, > bo. 

(S;) An equilibrium state x, of a dynamic system is equiasymp- 
totically stable {(S;): uniformly asymptotically stable} if 


(a) it is stable [uniformly stable] and 

(b) to any number yu > 0 there corresponds a number 7, to) 
such that xo, lo) — x,|| Spforalt, 2H+T 
whenever — x,|| r(to) > O[r > 0] being some fixed con- 
stant which does not depend on yp or xo. 


(S,) An equilibrium state x, of a dynamic system is equi- 
asymptotically stable in the large [(Ss): uniformly asymptotically 
stable in the large) if 


(a) it is stable funiformly stable]; 

(b) all motions are bounded [uniformly bounded]; 

(c) all motions (t,; xo, t) (xo, & arbitrary) converge [con- 
verge uniformly in |/x0|| < r, fo; r being arbitrarily large] to x, with 
increasing t,. 

The various implications between these definitions, as stated in 
the companion paper,’ hold in the discrete case also. 


6—The Main Theorems 


We state here results corresponding to those of the companion 
paper.* After obvious modifications (replacing continuous ¢ with 
discrete t,, integrals by sums, etc.) the proofs are identical with 
those given in the companion paper and are therefore omitted 
(see Hahn [3] for a self-contained account). 

Theorem 1*. Consider the discrete-time, free dynamic ‘system: 


x(ten) = h(x(t,), ty) 


where h(O, t,) = © for all t,. SUPPOSE there exists a scalar 
function V(x, t,) such that V(O, t,) = 0 for all t,, and 

(i) V(x, t,) is positive definite; i.e., there exists a continuous, 
nondecreasing scalar function a such that a(0) = 0 and, for all t, 
and all x # 0, 


(5*-F) 


0 < < Vix, ¢,); 


(ii) There exists a continuous scalar function yy such that y(0) = 0 
and, for all t, and all x # 0, 


AV(x, t,) = rate of increase of V along motion starting at x, t, 
[Vid(tin; x, ty), torr) — Vix, &) Mien — 
< 08; 


’ When the discrete-time system is given abstractly (i.e., not by 
discretization of a continuous-time system), there is no loss of gen- 
erality in assuming that t,,: — t, = 1 for all k. 
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(iii) There exists a continuous, nondecreasing scalar function B 
such that B(O) = 0 and, for all t, and all x # 0, 


V(x, 


(iv) ax(||x|]) © when ||x|| ©. 

THEN the equilibrium state x, = 0 is uniformly asymptotically 
stable in the large and V(x, t,) is a Lyapunov function of the system 
(5*-F). 

Corollary 1.1*. The following conditions are sufficient for various 
weaker types of stability: 


1 Uniform asymptotic stability: (i-iii). 

2 Equiasymptotic stability in the large: (i-ii), (iv). 

3 Equiasymptotic stability: (i-ii). 

4 Uniform stability: (i), (iii), and (ii): AV(x, t,) S 0 for all 
x, t,. 

5 Stability: (i-ii,). 


Corollary 1.2*. If the system (5*) is autonomous, it suffices tc 
take V as a function of x only, such that V(O) = 0 and 


(i-A) Vix) > O when x + 0; 
(ii-A) AV(x) < 0 when x # 0; 
(iii-A) V(x) is continuous in x; 
(iv-A)  Vi(x)— when || —> ~, 


Corollary 1.3*. In Corollary 1.2*, Condition (ii-A) may be re- 
placed by 


(ij-A) AV(x) S 0 for all x; 
(ii-A) AV((t,; x0, does not vanish identically in t, 2 bo 
for any to and any xo ¥ 0. 


The problem of the converse of Theorem 1* (i.e., does uniform 
asymptotic stability in the large imply the existence of a Lyapu- 
nov function with properties (i-iv)?) apparently has not been in- 
vestigated so far. Theorem 2 of the companion paper’ does not 
carry over trivially in this case because, as we have seen in 
Example 1, the class of all discrete-time dynamic systems is 
larger than the class of all continuous-time dynamic systems. 

Theorem 3*. Consider the discrete-time, linear, dynamic system 


= H(t,)x(t,) + A(t,)u(t,) 
and ASSUME that 


(5*-L) 


(i) (&- tis) So < for allt,’ 


(ii) O < ce S (tes: ||A(t,)x!] < for all = 1, 
all t, 


THEN the following propositions concerning this system are 
equivalent: 


(A) Assuming xo = 0, any uniformly bounded excitation 
< © when t, = bo 
gives rise to a uniformly bounded response for all t, = to, i.e. 
toS<te 
S ea) < ©; 
(B) For all t, > tb, 


St 
(C) The equilibrium state x, = 0 of the free system is uniformly 
asymptotically stable; 
(D) There exist positive constants cz, cs such that, whenever 
2 bo, 


In this theorem, ¢1, 


. . . denote various positive constants. 
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}@(t,, to)]] (15*) 


(E) Given any positive definite matrix Q(t,) satisfying, for all 
t, 2 bo, 


0 < cyl S < 


the scalar function defined by 
V(x, &) = (tin — t,)|| 


= Pca 


exisis® and is a Lyapunov function for the free system, satisfying 
the requirements of Theorem 1* with 


(17*) 


(18*) 


AV(x, t,) = 


Corollary 3.1*. A discrete-time, free, linear, stationary, dynamic 
system 


= Hx(t,) (tens — ty = 1) (5*-FLS) 
is asymptotically stable if, and only if, given any symmetric, posi- 
tive-definite matrix Q there exists a symmetric, positive-definite 
matriz P which is the unique solution of the linear equation 


H’PH = -Q (19*) 


Vix) = 
is a Lyapunov function for (5*-F LS) with 
AV(x) = 


Corollary 3.2.* The eigenvalues of a constant matriz H are less 
than p in absolute value if, and only if, given any symmetric, posi- 
tive definite matriz Q the linear equation 

pH’PH — P = -Q (20°) 
has a unique, symmetric, positive-definite solution P. 

The proof of Theorem 3 and its corollaries is entirely analogous 

to the proof in the continuous-time case. 


7—Applications to Stability Theory 


7.1 Routh-Hurwitz Conditions for Linear Stationary Systems. As in 
the companion paper,’ Corollary 3.1* gives a purely algebraic 
procedure for checking the asymptotic stability of a free, linear, 
stationary system. (As is well known, a necessary and sufficient 
condition for this is that all eigenvalues of the matrix H in (5* 
FLS) be less than unity in absolute value.) Take any symmetric, 
positive-definite Q (say, the unit matrix); invert (19*) to find P, 
which is always possible in case of asymptotic stability; test P 
for positive-definiteness, in other words, require the leading 
principal minors of P to be positive—these n inequalities are 
analogous to the Routh-Hurwitz conditions. 

This procedure is illustrated in Example 6*. The procedure is 
most useful for machine computation when the number of non- 
zero elements of H is larger than about 4 or 5; in such cases, a 
stability test may require very extensive numerical computations 
by conventional methods. The analog of the canonic matrix 
(33) (Example 7) of the companion paper is so far not yet availa- 
ble for discrete systems. 

Example 6*. Routh-Hurwitz Conditions for General Second-Order 


71f A, B are two matrices, we write A < B[A < B] to express the 
fact that B — A is positive definite [semidefinite]. 

We write x'Ax = ||x||2a. 

* L.e., the infinite sum (17*) converges. 
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Case. Consider the case when H is a 2 X 2 matrix. Direct sub- 
stitution of the general expression for H into (19*) leads to very 
complicated algebra so that some simplification must be sought. 

If H = MM, then A? = det H < 1 insures stability. IfH # Al, 
then it is easy to show that there is some nonsingular matrix T 


such that 
THT = [ ] 


Writing out (19*) in detail for H = C, with Q = lI, we get 
0 
0 a, 
1 —2az 


-1 
0 
a,? — 1 


This set of equations has a unique solution if the determinant of 
the matrix on the left does not vanish: 


det € = (a, — 1){(1 + a)? — a,*] #0 
Assuming (26*), we find after elementary computations 


(26*) 


det 2a,%1 + a;) 


— 2a\a2 


— 


+ 


P = (det 


Now P is positive definite if, and only if, 


2(1 + a,?) 
(1 — + ay)? — ag?) 


det P = >0 (28*) 


2(1 + a) 
(1 — + — 


Pra >0 (29*) 


But (28*) implies that 
(1 + a,)? — a, >0 
and this fact, with (29*) implies that 
Now observe that a; = det C and a, = —trC. Since the deter- 


minant and the trace are invariant under similarity transforma- 
tions, it follows that we can write equally well 


|tr < [1 + det (30*) 

H] <1 (31°) 

which is the general form of the Routh-Hurwitz inequalities in the 

second-order case. Finally, it follows from (30*-31*) that condi- 
tion (26*) is always satisfied. 

7.2 Stability in Nonlinear Systems. The analog of Krasovskii's 
theorem (Theorem 4) of the companion paper? is well known in 
mathematics [7] but not in engineering. It is based on the 
following concept: A function f(x) is said to be a contraction if 

< = 0) 
for some set of values of x ~ 0 and some norm. With this in 
mind, we can state 


Theorem 4*. Consider the discrete-time, free, stationary, dynamic 
system 


= h(x(t,)) 


ASSUME that h is a contraction for all x and some norm. 
THEN the system (5*-FS) is asymptotically stable in the large, 
and one of its Lyapunov functions is 


V(x) = 


(h(O) = 0) (5*-FS) 


(2) 
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footnote 4), we find that 


PROOF: Computing AV and assuming ti: — t, = 1 (see 


the Linear Stationary Case. Consider the system (ts: — 4 = 1) 


= Hix(t,) + K(x(t,))x(t,) + u(t,) (60*) 
V = |{h(x)|| — (3) 
t Sec 
which is negative definite by definition of a contraction. Q.E.D. where [WKcx(4,))]| $e forall t, 
Example 2. Conditions for Contraction. Consider the system lu(t,)!! 
(ters) = (4) Assuming H is asymptotically stable, take any symmetric, 
Assume that there are positive constants ¢, ..., c, such that Positive-definite Q and define a Lyapunov function 
either V(x) 2p 
n 
(a) Max {x rs rvcel <1 forall x by means of Corollary 3.1*. Then, evidently, 
¢ AV(x) = + 2[x’(t,)K’(x(t,)) + |PHx(t,) 
or 


(b) Max <1 forall x 
i 


t=1 


Then in either case H(x)x is a contraction for all x and therefore 


the system (4) is asymptotically stable in the large. 
In case (a), define the norm by 


= Max 


Then 


j=1 
= Max {> 
Yat 
i j=1 


which verifies that H(x)x is a contraction. Note that in this 


norm, the contours V = constant = ¢) are cubes with edges 
of length 


In case (b), define the norm by 


n 


lel] = 


i=l 


Then 


Get 


1 j=1 


which yields the desired result. 
It is clear that whether or not a given function is a contraction 
depends on the norm chosen. This is where ingenuity is needed. 


8—Estimation of Transient Behavior 


By regarding V as a measure of “‘distance’’ in state space, the 
estimation of transient behavior proceeds in a manner which is 
completely analogous to that given in the companion paper.* 
Only the details of some of the calculations are different. This is 
illustrated by the analog of Example 15: 

Example 15*. Effect of Perturbations and Parameter Variations in 
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+ [x’(t, K’(x(t,)) + u’(t,) (K(x(t,))x(t,) + u(t,)] (61*) 


Employing the Schwarz inequality, (61*) may be written in the 
form: 


AV(x) s| + + Vix) 


+ 
2a 


] + (5) 


Using repeatedly the relation (A, B = symmetric, positive- 
definite ) 


Max = Amax(AB™') 
each one of the terms in (5) may be maximized separately, which 


leads to the following: 
AV(x) S + + 
+ 2c; [es + + (6) 
where 
T = Amin T = — & 


7 being an estimate of the time constant of the system similar to 
that discussed in the companion paper;*? and 


C2 = 

= 

= 
From (6) we see that AV(x) will be negative if 


> [eocstes + + + re] 
ent — — 


provided that the denominator of the foregoing expression is 
positive, which is just the condition for asymptotic stability when 
Q = 0. 


$—Relations With System Optimization 


Similarly to the continuous-time case, define a performance in- 
dex in the stationary case by 


Vix) = x, 0)) (65*) 
t=0 
where p is the error criterion. 
For exactly the same reasons as before, we are led to 
Theorem 5*. (Kalman [8]) Consider a free, linear, stationary 


dynamic system with an equilibrium state at the origin and AS- 
SUME: 


(i) The error criterion p(x) is positive definite and p(O) = 0; 
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(ii) The performance index V(x) defined by (65) is finite in some 
neighborhood of the origin. 


THEN «x, = 0 is asymptotically stable. 

The entire discussion of this theorem in the companion 
paper® is applicable here too. We add only one remark: 

In case p(x) = z;? (x, being the output of the system), the per- 
formance index (65*) is obviously minimized for arbitrary x(0) 
if z,(t,) =O for allt, > 0. As Bergen and Ragazzini have shown 
{9}, this can indeed be accomplished by suitable digital compensa- 
tion of single-input systems. As they also pointed out, however, 
their design procedure (which is unique) does not necessarily 
result in a stable system. 

The situation is as follows: If H(z) is the open-loop pulse 
transfer function of the plant to be controlled including the zero- 
order hold, then the transfer function D(z) of the Bergen-Rag- 
azzini controller must contain poles to cancel the zeros of H(z) 
because of the condition D(z)H(z) = z — 1. This cancellation is 
never exact, and if H(z) contains zeros outside the region |z| < 1, 
then elementary root-locus arguments show that the resulting 
system will have a pole outside the unit circle and hence be un- 
stable, no matter how nearly the zero is canceled. This can hap- 
pen quite readily even if the plant is open-loop stable; in such 
cases Bergen and Ragazzini recommend a modified procedure. 

It is sometimes stated that the Bergen-Ragazzini design as just 
sketched fails because of the impossibility of perfect cancellation. 
This is misleading and erroneous, for even with perfect cancellation 
the Bergen-Ragazzini design is not asymptotically stable when H(z) 
contains zeros outside the unit circle. 

In fact, suppose the contrary; i.e., under perfect cancellation 
the system is asymptotically stable. An imperfect cancellation is 
readily seen to be equivalent to a perturbation of the equations 
of the system, as in (60*), But, by Example 15%, if co is suf- 
ficiently small (i.e., good but not necessarily perfect cancellation) 
then the system remains asymptotically stable. In other words, 
asymptotic stability is a system property which cannot be destroyed by 
small perturbations in the coefficients. But this contradicts the 
foregoing conclusions about imperfect cancellation. 

The fallacy in the conventional reasoning is that one cannot, in 
general, conclude that a stable response to a step input implies 
stability of motions starting at arbitrary initial states (see also 
condition (ii) of Theorem 3*). 

We have here a particularly simple example of the fact that 
minimization of performance indexes based on semidefinite error 
criteria is not necessarily free of stability difficulties. 


10—Application to Design 


In the corresponding section of the companion paper,’ we con- 
sidered the problem of designing a regulating system for the plant 
governed by the equations 


dx/dt = Fx + Du(t) (7) 
where the control variables are subject to the constraints 
|u,(t)| (¢ =1,...,m) (8) 


If the controller is to be a sampled-data system, it is convenient 
to transform (7) first into the discrete-time equivalent form 


= Hx(t,) + Au(t,) (70*) 


where it was assumed that 4.4, — 4, = T = constant for all k, 
and 


H =exp7F; A -f, (exp TF)D dr 


Assume that F (and therefore H) is asymptotically stable and 
given any symmetric, positive-definite Q find the corresponding 
symmetric, positive-definite matrix P by means of Corollary 
3.1*. Then 
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Vix) = ||x\|* 


is clearly a Lyapunov function for the free part of system (70*). 
The difference of V along a motion of (70*) is then 


AV(x(t,), = + 2u’(t,)A’PHx(¢,) 
+ (9) 


An approximately optimal and practically useful design is ob- 
tained by selecting u(t,) at each sampling instant ¢, in such a way 
as to minimize AV. 

It is clear that the right-hand side of (9) is a quadratic function 
in the components of u. Disregarding at first the limits (8) on 
the components of u, OAV /du = 0 if u is given by 


u°(t,) = —(A’PA)—'A PHx(t,) 


The matrix A’PA is obviously positive semidefinite since P is 
positive definite. If A is 1-to-1 (ie., Au = 0 implies u = 0 or, 
equivalently, the columns of A are a linearly independent set), 
then A’PA is positive definite so that the inverse required in 
formula (10) actually exists. Moreover, positive definiteness of 
A’PA implies also that (9) has a true minimum at the point 
where the derivative with respect to u vanishes. 

If we now take into account (8), it follows that u should be set 
as close to u° as allowed by the constraints. Hence 


u,*(t,) = —c, sat PHx(t,)); (¢ =1,.. 


u*(t) = u*(t,), 


(10) 


4m) 

(72*) 
 St< ten 
where 

+1, 
satz = 2, 
-1, 


z>.1 
$1 
z<-l 


The difficulties encountered in the companion paper* in proving 
asymptotic stability in the large when F and D are not known 
exactly and when some state variables are not measurable do not 
arise here because the function sat changes by small amounts 
for small perturbations in the argument. 

Theorem 6*. Let x be expressed uniquely as y + 2 where y de- 
notes the measurable and z the unmeasurable state variables; % is 
the predicted value of z. Let A, A denote the available estimates of 
H, A and? = P(A, Q). IF 


(i) < 1 for all i and 
(ii) H — A, A — A are sufficiently small 
THEN the over-all system 


= Hx(t,) + Au*(A, A, B, y(t.) + 20¢,)) 
(ten) = Any(t,) + Ant(t,) + Aw*(A, A, y(t.) + 2(¢,)) 


is asymptotically stable in the large. (In the foregoing, u* denotes 
the function defined by (72*); Hx, Hee are the lower two sub- 
matrices obtained by partitioning H into four parts, in accordance 
with decomposing x as the direct sum of y and z.) 

PROOF. Assume first that H, A are known exactly. Using 
the notation Z = z — 2 for the error in predicting the values of the 
unmeasurable state variables, the over-all system equations be- 
come: 


= Hx(t,) + Au*(x(t,) + Z(t) 
(11) 
= 


Let N be a symmetric, positive-definite matrix. Using assump- 
tion (i), find the corresponding symmetric, positive-definite 
matrix M via Corollary 3.1*. Define the Lyapunov function 


V(x, Z) = + || (12) 
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Using the abbreviation W = H’PA(A’PA)-A’PH, we find 


after simple calculations 
LV(x, Z) S T-*{ —Amin(Q) |x |]? — Amin(N) ||? 

This expression is negative definite if 


—Ymia(Q) Amin(N) — || + || < 0; 


the foregoing condition is satisfied if Amin(N) is sufficiently large, 
which can be brought about by making N large. 

It is now clear that (11) is asymptotically stable in the large, 
with a Lyapunov function of the form (12) and 


AV(x, Z) —k(|| «||? + 


where 0 < k < T'Xmia(Q). 

Inexact knowledge of H and A adds perturbation terms to 
(11), analogous to the terms K(x(t,))x(t,) considered in Example 
15*. By assumption (ii) and the results of Example 15*, the 
conclusion of Theorem 6* follows. Q. E. D. 
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Relations Between the Notch Tensile 
Strength of Cylindrical and Prismatic 
Specimens of Titanium Alloys and 
Heat-Treated Steels’ 


The basic relations which govern the behavior of titanium alloys in the presence of 
stress concentrations, as caused by notching, were investigated for cylindrical and 
prismatic tension test specimens. 
temperature, for 4340 steel heat-treated to various strength levels, and for the titanium 
alloys 6Al-4V and 5Al-1.5 Fe-1 4Cr-1.2Mo (Ti-155A). 


Notch tensile strengths were determined, at room 


Some effects of notch geometry, 


specimen shape, type of loading, material history, and test temperature were included in 


the investigation. 


The information acquired from these studies leads to recommendations pertaining to 
the design of notch tensile specimens for evaluating the relative notch sensitivity of 


titanium sheet alloys. 


General Introduction 


TRUCTURAL AIRCRAFT PARTS made from high bred al- 
loys are known to behave occasionally in a very brittle manner. 
Many factors have been recognized as contributing to such pre- 
mature failures and these relations have been extensively studied 
for heat-treated steel forgings at room temperature [1]? and for 
various heat resisting alloys at elevated temperature [2]. In both 
instances, sharply notched cylindrical tension specimens, de- 
veloped for this purpose [3, 4] are extensively used as a means of 
evaluating embrittling factors. 

In contrast, no such basic information and very little practical 
test data are available on the tendency toward embrittlement of 
shapes, other than cylindrical, and particularly on that of sheet 
(2, 5 to 9]. The small amount of such information does not pro- 
vide any definite basis for the evaluation of embrittlement. This 
also applies to some data on tubing [10]. 

The primary aim of such tests is to obtain a stress or “strength” 
value which indicates a considerable loss in load carrying capacity 
within a certain area of service variables. This deficiency is re- 
vealed by comparing the test value, usually the ‘‘notch strength” 
of a sharply notched specimen, with a reference value for the same 
material under conditions where this material would behave in a 
ductile manner. 

The objectives of the work described in this paper were to 
establish such basic data for titanium alloy sheet [5 to 9]. Asa 
first step in this direction, it was considered necessary to explore 
two major sets of relations. First, the relations between the be- 
havior of titanium alloys to that of a material where the em- 


1 The work described in this paper was performed under Syracuse 
University Research Institute Contract NOas 57-483-d with the 
Navy Department, Bureau of Aeronautics. 

2? Numbers in braekets designate References at end of paper. 

Contributed by the Metals Engineering Division and presented 
at the Metals Engineering Conference, Albany, N. Y., April 29-May 
3, 1959, of Tae American Society OF MecnanicaL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
26, 1959. Paper No. 59—Met-2. 
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brittlement phenomena are well recognized must be established. 
Such data, at room temperature, are primarily available for cylin- 
drical specimens of heat-treated steels (3, 4, 14 to 18]. 

A few tests on titanium alloys in the form of cylindrical speci- 
mens [19 to 23] indicate that this group of materials behaves in a 
manner very similar to heat-treated steels. 

Therefore the first part of this paper concerns a comparison of 
the effects of a number of factors on the notch strength of both 
steels and titanium alloys. 

Second, the relations between the behavior of flat specimens to 
that of cylindrical specimens must be established. Because of the 
complete lack of this type of data, it appeared desirable to study 
again both titanium alloys and heat-treated steels. 

All materials can be classified as either “‘notch insensitive’ or 
“notch sensitive.” Notch insensitive metals possess a notch 
strength that is generally higher than the ultimate tensile strength 
by an amount that depends upon the notch geometry. In the 
case of round specimens, the notch strength ratio of sharply 
notched cylindrical specimens which are of primary interest is 
generally increased as the “notch depth,” or area removed by 
notching, becomes greater. For example, for 30 per cent notch 
depth, the ratio of notch strength to tensile strength, or 
“notch strength ratio” (of many “notch ductile’ materials), is 
about 1.3, and for 50 per cent notch depth about 1.5. This applies 
both to heat-treated steels [3, 4] and to titanium alloys [20, 21]. 

These relations are unknown for shapes other than cylindrical, 
which are subjected to concentric loading. 

A notch sensitive, or ‘‘notch brittle,” material generally exhibits 
a notch strength ratio that is below that of notch ductile materials, 
if the specimen is properly designed, i.e., if it is not too small and 
the notch is deep and sharp. 

The notch strength ratio of such a specimen, therefore, is a 
measure of the sensitivity of this material to suffer a loss in load 
carrying capacity if the part contains areas of high stress con- 
centration. However, it must be compared to the value for a 
similar, notch ductile alloy condition. Also, notch sensitivity 
usually occurs only in certain ranges of geometrical variables and 
cannot be revealed if the tests are outside of this area, e.g., if the 
notch, of the section variations, are rather mild. 
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Furthermore, when cylindrical specimens are tested concen- 
trically they are considered as mildly notch sensitive if the notch 
strength ratio is greater than unity, but as severely notch sensi- 
tive if the notch strength ratio is less than unity. However, this 
is an arbitrary distinction, and it is very doubtful that it can be 
applied to flat (sheet) specimens. 

This paper presents only experimental evidence. Fundamen- 
tal evaluation of its content will be postponed until further test 
data become available on sheet specimens of various dimensions. 
However, in order to facilitate the use of the data presented to 
date, a graph is presented in Appendix II which shows the elastic 
stress concentrations for the major geometries used in this paper. 


Experimental Procedure 


Materials, The steel and titanium alloys included in the in- 
vestigation are presented in Table 1. 

Specimens. Three basic types of test specimens were used for 
the investigation. Cylindrical, square, and rectangular cross-sec- 
tion specimens are shown in Figs. l(a, b, and c), respectively. 
All of the specimens were machined from original stock with the 
axis of the specimen parallel to the rolling direction of the ma- 
terial. The 4340 steel cylindrical and prismatic specimens were 
rough machined to within 0.020 in. of the final dimensions, heat- 


RECTANGULAR SPECIMENS (c) 


Fig. 1 Cylindrical, square, and rectangular specimens used for the 
investigation 


Table 1 Source, heat-treatment, and tensile strengths of the investigated alloys 


Tensile 
strength, 
Alloy Heat-treatment psi 
SAE 4340 1550 F-1 hr-oil 
500 F-1 hr-air 270,000 
SAE 4340 1550 F-1 hr-oil 
1000 F-4 hr-air 165,000 
Ti-6Al-4V 1725 F-10 min-water 
(DOD alloy) 900 F-8 hr-air 165,000 
Ti-5AL-1.5Fe 1700 F-1 hr-water 
1.4Cr-1.2Mo 900 F-8 hr-air 220,000 


(Ti-155A) 


treated according to Table 1, and then finish ground to size. The 
titanium specimens were heat-treated in the form of blanks and 
then machined to the final dimensions after heat-treatment. 
Cylindrical specimens were turned in a lathe and flat specimens 
were milled to the desired shape. 

Notches in cylindrical specimens were machined with a single 
point, carbide-tipped lathe tool which was carefully ground to the 
desired notch radius. The flat specimens were notched with mill- 
ing cutters and carbide-tipped swing tools of selected radii. In 
addition, a few notches were made by grinding (4340 specimens 
only) to compare the behavior of ground notches to that of 
machined notches of like radii and depth. No significant dif- 
ference in their behavior was observed. The notches in titanium 
specimens were all obtained by machining. 

The same cylindrical smooth and notched specimens, as shown 
in Fig. 1(a), were used for both room temperature (and below) 
and elevated temperature tensile and stress-rupture tests on heat- 
treated Ti-6Al-4V and Ti-155A. 

Test Procedure. The tensile tests on both steel and titanium al- 
loys were performed in a 60,000-pound capacity Baldwin-South- 
wark Universal Testing Machine, at a crosshead speed of 0.005 in. 
per minute. A special guiding fixture, as shown in Fig. 2, was 
used to insure the concentricity of axial loading that is required 
for the notch tensile testing of brittle materials at room tempera- 
ture and below. This fixture was not used for tensile tests at ele- 
vated temperatures, but was replaced by a self-aligning system 
with universal joints incorporated at the upper and lower at- 
tachment points. The specimens were heated in a vertical-muffle 
resistance furnace and the desired test temperature was con- 
trolled to within +5 F by means of Brown Electronik Tempera- 
ture Control Units and Chromel-Alumel thermocouples attached 
to the specimen. 

Conventional equipment and techniques were used for the per- 
formance of stress-rupture tests. These procedures have been 
adequately described elsewhere [24] and need not be further 
elaborated upon here. 


PART | 
Test Results on Cylindrical Specimens 


Introduction. The relations which govern the notch strength of 
cylindrical specimens have been comparatively well established 
for heat-treated steels. The steel used primarily in previous sys- 
tematic investigations was 3140 [4, 11]. Extensive tests have 
been also performed on 4340 [15 to 18]. 

To establish more definitely the effects of several factors, tests 
were performed on 4340, selecting a strength level of 165,000 psi 
as a notch ductile and 270,000 psi as a notch brittle condition. 

The effects of a number’of variables on the notch strength of 
cylindrical specimens are also known for several titanium alloys 
[19 to 23]. These range, regarding their notch sensitivity, from 
notch ductile to severely notch sensitive conditions. Some of 


Original 
shape of 
stock Supplied by 
Hot rolled Rotary Electric Steel Co. 
4-in. diam bar 
Hot rolled Rotary Electric Steel Co. 
4-in. diam bar 
Titanium Metals Corp. 
1/.-in. plate 
Titanium Metals Corp. 
3/,-in, diam bar 


Nore: 1 The titanium alloys were solution treated in an argon atmosphere. 
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2 Water quench time was less than 5 seconds. 
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Fig.2 Photograph of concentric guiding fixture used for the performance of tensile tests at room 
temperature 


these data relate to Ti-GAI-4V, heat-treated to various strength 
levels, which appears only slightly notch sensitive, and to Ti-5Al- 
1.5Fe-1.4Cr-1.2Mo (Ti-155A), which is found to be highly notch 
brittle, if heated to strength levels above 200,000 psi [22]. 

This latter alloy, heat-treated to a strength level of 220,000 psi, 
was selected as representative of a notch brittle titanium alloy 
(at room temperature) to establish such relations as the effects of 
notch depth and notch sharpness. 

Further tests were performed at testing temperatures, ranging 
from — 100 to 800 F, on both Ti-6Al-4V and Ti-5Al-1.5Fe-1.4Cr- 
1.2Mo. 

At relatively low temperatures, both steel and titanium alloys 
are known to become rather notch brittle. However, only ti- 
tanium alloys were studied here, although the notch sensitivity 
of heat-treated steels, at temperatures below room temperature, 
is not clearly established. 

At elevated temperatures, apparently all steels become notch 
sensitive within certain areas of strength, applied stress, tempera- 
ture, and time. As a first approach to this problem for titanium 
alloys, short time tensile tests were made on both of the alloys 
mentioned and stress rupture tests were performed on Ti-6Al-4V. 

Effects of Notch Depth and Notch Radius. The dependence of the 


NOCH STRENGTH RATIO 


NOTCH DEPTH - PERCENT 


Fig. 3 Relation between notch strength ratio, notch depth, and notch 
radius for cylindrical specimens of 4340 steel at two strength levels 
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notch strength ratio upon notch depth and notch radius is il- 
lustrated for the two steel conditions in Fig. 3 and for a number 
of titanium alloys (at room temperature) in Fig. 4. (The tests on 
specimens with very mild notches are neglected in these graphs.) 

The notch ductile conditions of both groups of materials exhibit 
a nearly linear increase in notch strength with increasing notch 
depth. The trendline is also practically independent of the 
notch radius, and it is the same for steels and for titanium alloys. 
The notch strength ratio is close to 1.3 for 30 per cent and close to 
1.5 for 50 per cent notch depth. 

Notch sensitivity appears in Figs. 3 and 4 as a deviation to 
lower values from the trendline for notch ductile conditions. 
This deviation depends upon the notch depth in a rather complex 
manner, and possibly differently for steels and titanium alloys, 
respectively. The deviation also greatly increases as the notch 
radius r decreases. The notch sensitivity was not apparent if 
notch radii above about 0.025 in. were used in this study. Rather 
sharp notches must be used to develop notch sensitivity in these 
materials. 

This effect is further illustrated by Fig. 5, in which the notch 
strength ratio is plotted as a function of notch sharpness a/r. 
Close agreement of steels and titanium alloys, in this respect, is 
again indicated by this graph. 


~ 


TITANIUM ALLOYS 


(T1-155A) 


| 
\GBIL & CARWILE 1955) 


NOTCH DEPTH - PERCENT 


Fig. 4 Relation between notch strength ratio, notch depth, and notch 
radius for cylindrical specimens of several titanium alloys 
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Fig. 5 Relation between notch strength ratio and notch sharpness for 


cylindrical specimens of 4340 steel and titanium alloys at 30 and 50 
per cent notch depth 
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Fig. 6 Relation between notch strength and tensile strength for two 
titanium alloys after various heat-treatments (Lenning, et al., 1956) 


220 


Effect of Strength Level and Heat-Treatment. Figs. 3 to 5 also con- 
firm the fact that notch sensitivity of both groups of materials 
can be disclosed by concentric notch tests on small (0.3 in.) 
cylindrical specimens if the tensile strength exceeds certain mini- 
mum values. These values are about 180,000 psi for steels and 
about 150,000 psi for titanium alloys, see Fig. 6. 

In addition, it is known for steels that notch sensitivity varies 
both with chemistry and with a number of other metallurgical 
factors [1, 12 to 14], the strength level being identical. Similar 
effects can be expected for titanium alloys. Thus it appears 
from Fig. 6 [22] that the notch sensitivity of Ti-5Al-1.5Fe-1.4Cr- 
1.2Mo depends upon both solution and aging temperatures. 

Results of Short Time Tensile Tests on Titanium Alloys at Different 
Temperatures. Steels become generally brittle at low tempera- 
tures. The extent of embrittlement revealed by notch tensile 
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Fig.7 Notch tensile characteristics of Ti-GAI-4V at several test tempera- 
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Fig. 8 Notch tensile characteristics of Ti-155A at several test tempera- 
tures 


SMOOTH DUCTILITY - PERCENT 


tests [15], however, is largely unknown for heat-treated steels. 
This problem has not been investigated here, as this would require 
a considerable amount of additional experimentation. 

Titanium alloys are also known to become brittle in the presence 
of stress concentrations as the testing temperature decreases [21, 


Transactions of the ASME 


Tu | TITANIUM ALLOYS 
30% NOTCH DEPTH 30% NOTCH DEPTH id 
eg }- 260 
we 
0.6 - 160 
120 5 
| 
re 
> ESTIMATED FOR hi 
re .005 
bi: 
REDUCTION OF AREA 
U 
=200 200 bor 60 800 
TEST TEMPERATURE - F - 
= — 0 
} 
= TENSILE STRENOTH 
~ & 200 7 = 
+?) ava AGING IN ERS. 
1. a 
.005 
STRAIN RATE = .005 IN. /MIN. 
ma i 
ts | WATER 1100F 2a=.212 
1700F 
x 


rT T T T 
| (Ti-155A) D= 
De. 
(GEIL AND CARWILE 1955/1956) 


mre .005 


-200 200 600 800 
TEST TEMPERATURE - F 
Fig. 9 Comparison of the notch strength ratios of three different ti- 
tanium alloys for short time tensile tests at various temperatures. (The 
curve for Ti-6Al-4V was obtained by interpolation.) 


x0d00 


8 


@OOTH DUCTILITY - PERCENT 


Fig. 10 Results of stress rupture tests on smooth cylindrical specimens 
of heat-treated Ti-6AI-4V. (All specimens were machined from }-in. 
plate.) 


23). This is illustrated by Fig. 7 for Ti-6Al-4V and by Fig. 8 for 
Ti-5Al-1.5F e-1.4Cr-1.2Mo. 

In Fig. 9 the notch strength ratio of these alloys and of a pre- 
viously studied commercially pure titanium (75,000 psi) are as- 
sembled. 

Commercially pure titanium (75,000 psi) appears, according to 
Fig. 9 [20], to retain nearly all of its high notch ductility down to 
a testing temperature of —320 F. The alloy Ti-6AI-4V, heat- 
treated to 165,000 psi, also becomes only slightly notch sensitive, 
as the testing temperature decreases from about 200 to —100 F. 
In contrast, the notch sensitivity of Ti-5Al-1.5Fe-1.4Cr-1.2Mo 
(220,000 psi) continuously increases as the testing temperature 
progressively decreases below 800 F. However, its pronounced 
notch embrittlement at room temperature only slightly changes 
at lower temperatures. 

The ductility of the two heat-treated alloys is presented in Figs. 
7 and 8. Pronounced notch sensitivity, i.e., a notch strength ratio 
below unity, appears to be associated with a ductility of less than 
about 10 per cent, as measured by reduction of area. 
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- 
Fig. 11 Results of stress rupture tests on notched cylindrical specimens 
of heat-treated Ti-6AI-4V. (All specimens were machined from }-in. 
plate.) 


1.2 
200 


TEST THOERATURE - 
Fig. 12 Results of stress rupture tests on th and notched cylindrical 
specimens of heat-treated Ti-6AI-4V at several test temperatures 


The curves in Figs. 7, 8, and 9, for the two heat-treated alloys, 
also indicate a certain arrest in the rate of increase with tem- 
perature of both notch strength ratio and ductility between 300 
and 800 F. This also appears associated with a slight arrest in 
the decrease of tensile strength and notch strength with increasing 
temperature. 

Results of Stress Rupture Tests on Ti-6Al-4V. Many heat-treated 
steels are known to undergo embrittlement at elevated tempera- 
tures. This phenomenon has been extensively investigated by 
means of stress rupture tests on both smooth and notched speci- 
mens [24 to 30]. These studies have shown that the notch 
strength ratio of high strength steels may be extremely low at 
certain combinations of temperature, applied stress, and rupture 
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time. This becomes more pronounced as the rupture life in- 
creases, simultaneously shifting the maximum effect to lower 
temperatures. 

No such information is available for titanium alloys, to date. 
Therefore, stress rupture tests at temperatures ranging from room 
temperature to 900 F, and rupture lives up to 1000 hours, were 
performed on smooth and notched cylindrical specimens of heat- 
treated Ti-6Al-4V. 

The results of these tests, assembled in Figs. 10 to 12, show 
little notch embrittlement of this alloy within 1000 hours at any 
temperature. Possibly, such an effect exists between 700 and 900 
F, but it is within the limits of error (about +5 percent). How- 
ever, this embrittlement appears more clearly in the ductility 
(reduction of area) data and it is, therefore, significant. 

There is a slight discrepancy between the results of the stress 
rupture tests and those of short time tensile tests, see Fig. 12, 
which cannot be explained. 


PART Il 
Test Results on Rectangular and Square Specimens 


Introduction. The relations which govern the notch sensitivity 
of specimens other than cylindrical are unknown. In order to 
establish these basic laws, extensive studies were made on 
rectangular and square specimens of the two 4340 steel conditions 
and of heat-treated Ti-6Al-4V. 

These section shapes add two variables to those encountered 
in tests on cylindrical specimens, namely, the ratio of the lateral 
specimen dimensions A/B and the magnitude of one of these 
dimensions, defining the ‘‘section size.” 

For practical reasons, it was decided to investigate shapes of 
equal initial cross-sectional area, and also to use the same se- 
quence of notch radii for different shapes. 

The tests have been limited to room temperature, to date. 

In most of the tests, notches were machined on only two of the 
four surfaces. Some preliminary tests indicated that specimens 
with notches on all four faces yielded no advantage over 
those with notches on two faces, see Appendix I. 

The primary objective of these studies was to establish the 
dependence of notch sensitivity upon the section shape. In order 
to measure notch sensitivity, the notch strength ratio of the ma- 
terial in question must be compared with that of a similar, but 
notcli ductile, alloy. In the present investigations, 4340 steel, 
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heat-treated to 165,000 psi, has been selected as such a ductile 
material for establishing the required base data. 

Effects of Notch Depth and Notch Radivs. The dependence of the 
notch strength ratio upon notch depth was investigated for two 
shapes, namely, the square (A/B = 1) and “‘simulated”’ sheet 
(A/B = 1/16). The results of these tests on the ductile and 
brittle 4340 steel conditions are assembled in Fig. 13 and on 
Ti-6Al-4V in Fig. 14. 

It appears that the notch ductile 4340, heat-treated to 165,000 
psi, possesses a notch strength ratio that again increases nearly 
linearly with notch depth. The rate of increase, however, depends 
greatly upon the section shape, as discussed further. 

The deviation of the notch strength ratio of the notch brittle 
4340, heat-treated to 270,000 psi, from these ductile base lines 
increases with decreasing notch radius. The maximum effect, or 
notch sensitivity, is found to be considerably greater for the flat 
shape than for the square section. This is explained by the fact 
that for the sharpest notch which could be obtained, r = 0.002 in., 
the notch sharpness, a/r (and stress concentration), is much 
larger for the flat than for the square. 

Ti-6Al-4V, heat-treated to 165,000 psi, ranges between the two 
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Fig. 15 Relation between notch strength ratio and notch sharpness for 
square and rectangular specimens of 4340 steel at 30 and 50 per cent 
notch depth 


Fig. 13 Relation between notch strength ratio, notch depth, and notch 


radius for square and rectangular specimens of 4340 steel at two strength 
levels 
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Fig. 16 Notch strength ratio as a function of notch sharpness for square 
and rectangular specimens of Ti-GAI-4V and 4340 steel for 30 per cent 
notch depth 


Fig. 18 Relation between notch strength ratio, A/B ratio, and notch 
radius for Ti-6AI-4V 


Fig. 19(b) Comporison of notch strength ratio and A/B ratio for 4340 
steel and Ti-6AI-4V for 0.004 to 0.005-in. notch radius 


Fig. 19(c) Comparison of notch strength ratio and A/B ratio for 4340 steel 
and Ti-6AI-4V for 0.020 to 0.025-in. notch radius 
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Fig. 17 Relation between notch strength ratio, A/B ratio, and notch 
radius for 4340 steel at two strength levels 


Fig. 19(a) Comparison of notch strength ratio and A/B ratio for 4340 
steel and Ti-6AI-4V for 0.001 to 0.002-in. notch radius 


steel conditions. For this alloy also, the apparent notch sensi- 
tivity is slightly greater for the flat than for the square section, 
but this effect is less pronounced than was observed for the brittle 
steel condition. 

However, if the data are replotted, according to Figs. 15 and 16, 
with notch sharpness as abscissas, the two shapes appear to be 
nearly equivalent in regard to notch sensitivity, at a notch sharp- 
ness of about 30. Flat sections are more sensitive to sharper 
notches while square sections appear slightly superior for 
milder notches. On the other hand, a relatively sharp notch is 
applied more readily to a wide strip than to a small square. 
This is fortunate as it renders notch tests on sheet a rather 
simple procedure. 

Effects of Section Shape. In order to further establish the effect 
of geometry on notch sensitivity, five different section shapes 
were investigated; A/B = 1/16, 1/4, 1 (square), 4, and 16. 

The results of these tests on the three sections not yet dis- 
cussed are also illustrated in Fig. 16, with the notch sharpness as 
abscissas. It is clear from this graph that the flat sections which 
are notched on the face, A/B = 4 and 16, fail to disclose notch 
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sensitivity, because of the mechanical inability to provide them 
with sufficiently sharp notches. 

Further comparison of the test data is presented in Figs. 17 to 
20, with the section ratio (A/B) being used as abscissas. 

The notch strength ratio of 4340, heat-treated to 165,000 psi, 
outlines the base line (or upper limit of values) obtainable for this 
type of steel, Fig. 17. The notch radius has little effect on the 
position of this base line, except when it becomes very large 
(greater than r = 0.050 in.). Effects of such mild notches are of 
no significance for evaluating notch sensitivity and will not be 
further discussed. 

Fig. 17 also reveals that the notch strengthening of a very thin, 
wide specimen that is notched at the edge is nearly nil. For A/B 
= 1/16, a sharp 30 per cent notch raises the notch strength of 
notch ductile conditions by less than 5 per cent above the tensile 
strength. The maximum notch strength ratio then increases 
rapidly with increasing A/B, to about 1.1 for A/B = 1/4, and 1.2 
for A/B = 1. The notch strength ratios for notch ductile con- 
ditions at A/B = 4 and 16, and 30 per cent notch depth, have not 
been directly determined here. However, it appears from the 
data presented in Fig. 17 that these values are probably close to 
1.3, or the same as that for cylindrical specimens. 

The high notch sensitivity of 4340, heat-treated to 270,000 psi, 
can be disclosed, according to Fig. 17, only by testing flat speci- 
mens, notched at the edges, and nearly square specimens. Also, 
the notch radius must not exceed 0.010 in. for very thin speci- 
mens, and 0.005 in. for nearly square specimens, with a width be- 
tween '/, and one inch, 

Ti-6Al-4V, heat-treated to 165,000 psi, is found to be only 
slightly notch sensitive, in the shape of flat specimens, Figs. 18 
and 19. The notch radius must not exceed 0.005 in., in order to 
disclose this effect, beyond the limit of test scattering. 

Comparison of all three investigated alloy conditions, Figs. 
19(a, b, c), further illustrates these relations. It also confirms for 
flat specimens the fact that titanium alloys respond to stress con- 
centrations in much the same manner as heat-treated steels. 


Conclusions 


The studies reported herein were conducted primarily to es- 
tablish the basic relations required for evaluating the relative sen- 
sitivity of titanium sheet alloys to the effects of stress concentra- 
tions, as caused by notches. Some effects of notch geometry, 
specimen shape, type of loading, material history, and test tem- 
perature were investigated for both heat-treated steels and 


titanium alloys. These studies have resulted in the following 
conclusions: 


1 Cylindrical and prismatic specimens of notch ductile steels 
and ductile titanium alloys exhibit a nearly linear increase in 
notch strength ratio with increasing notch depth. This rate of in- 
crease, however, depends greatly upon cross section shape. Also, 
these ductile trend lines are practically independent of notch 
radius. 

2 For materials other than ductile, the deviation of notch 
strength ratio from the ductile trend lines to lower values increases 
with decreasing notch radius. The maximum effect (notch sensi- 
tivity) is nearly the same for cylindrical and flat (sheet) specimens 
(notched along two edges), if a sufficiently sharp notch of 30 to 
50 per cent notch depth is employed for the tests. This effect is 
similar for both steels and titanium alloys. 

3 Notch embrittlement is observed for both steels and ti- 
tanium alloys if their tensile strength levels exceed about 180,000 
psi and 150,000 psi, respectively. The degree of embrittlement 
observed depends greatly upon section shape, notch geometry, 
and tensile strength. 

4 At room temperature, the alloy Ti-6Al-4V (165,000 psi) 
reveals only mild notch embrittlement, as compared to the rather 
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severe notch sensitivity exhibited by high strength 4340 steel 
(270,000 psi). At higher temperatures, the titanium alloy is 
practically insensitive to notches within 1000 hours, at any tem- 
perature up to 900 F, according to stress-rupture strength data. 
However, a significant degree of embrittlement, as measured by 
reduction of area, was observed between 700 and 900 F. 

5 The alloy Ti-155A (220,000 psi) is extremely notch sensi- 
tive at room temperature and below. This alloy becomes less 
sensitive with increasing test temperature and exhibits nearly 
ductile behavior at 800 F. 


6 It appears that the relative notch sensitivity of sheet 
materials can best be evaluated by testing flat specimens, notched 
at two edges with notch radii of 0.010 in. or less, and with a 
maximum thickness to width ratio (A/B) of 0.25. A notch of 60- 


deg included angle and 30 to 50 per cent notch depth is recom- 
mended. 
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Fig. 20 Relation between notch strength ratio 


and note 
cylindrical and square specimens (notched two sides and four sides) 


of 4340 steel for a constant notch radius of 0.005 in. 
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APPENDIX! 


The Notch Strength of Specimens Notched on Two Sides 
and on Four Sides 


A few tests were performed on 4340 steel, heat-treated to 
270,000 psi, in which square and rectangular specimens were either 
notched at two sides or on all four surfaces. Various notch 
depths, from 10 to 80 per cent, and a constant notch radius of 
0.005 in. were employed for this investigation. 

The effects of notch depth were studied for square specimens 
with the results as shown in Fig. 20. The notch strength ratio of 
both types of notched square specimens (notched on 2 sides or on 4 
sides) were very nearly equal at any notch depth up to 80 per 
cent. In addition, Fig. 20 reveals little difference in notch strength 
ratio between square and cylindrical specimens. 


NOTCH STRENGTH RATIO 


NOTCH DEPTH - PERCENT 


Fig. 21 Comparison of notch strength ratio as a function of notch depth 
for square and thin rectangular specimens of 4340 cteel, having notches 
on either two faces or on all four faces 
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Fig. 22 Effect of section shape on the notch strength ratio of rectangular 
4340 steel specimens with notches on all four faces and a constant notch 


radius of 0.005 in. 
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The small effect of section shape for specimens notched on all 
four faces is probably explained by the fact that their stress state 
is always very similar to that of cylindrical specimens. It may be 
assumed, therefore, that the notch strength ratio of notch ductile 
materials, in these shapes, is also nearly the same as that of a 
cylindrical specimen. 

A comparison of the behavior of square and rectangular speci- 
mens (notched on 2 sides or on 4 sides) of 4340 steel (270,000 psi) 
is presented in Fig. 21. For thin sections, the notch strength 
ratio of specimens notched at two edges, as compared to those 
notched along two faces, is found to be greatly different. It is 
clear from this graph that the edge notches reveal notch sensitiv- 
ity much better than notches along two faces. As discussed pre- 
viously, thin sections which are notched along two faces fail to 
disclose notch sensitivity because of the mechanical inability to 
provide them with sufficiently sharp notches. 

The notch strength ratios of thin specimens, notched on all four 
sides, are approximately halfway between the values for speci- 
mens which were notched only on two sides. Also, these values 
are very nearly equal to those observed for the square shape, see 
Figs. 21 and 22. 

It is clear that the edge notch is the decisive shape factor, since, 
for this type of notch the notch sharpness (for a constant radius) 
becomes greater as the specimen width increases. 

Further tests on specimens with notches on four sides were dis- 
continued, as the considerably simpler flat specimens, notched at 
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two edges, appear suitable for the evaluation of the notch sensi- 
tivity of sheet. 
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STRESS CONCENTRATION FACTOR - K, 


NOTCH SHARPNESS - a/r 


Relation between notch sharpness (a/r) and stress concentration (K;)(31| 
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Nonmetallic Inclusions and Fracture 
Behavior of Steels 


Some observations showing the adverse effects of certain kinds of nonmetallic inclusions 
on the strength and fracture behavior of alloy steel test specimens are presented. The 


observations pertain to the effects in tests at low and high temperatures and under 


introduction 


HE PRESENCE OF A CERTAIN NUMBER of nonmetallic 
inclusions in structural steels as normally produced is almost in- 
evitable. In most instances, their effect on the mechanical 
properties is quite innocuous. However, it is possible that under 
unusual or severe testing conditions, an adverse effect of inclu- 
sions on test-specimen behavior may be found. 

In this paper, some observations of specific instances where the 
strength and fracture behavior of alloy steels were modified by 
the presence of inclusions are presented. The materials con- 
cerned are from large forgings of heat-treated alloy steels. The 
observations discussed relate to several different kinds of fracture 
phenomena; namely, low temperature, high temperature, and 
cyclic loading. The results presented are not intended to be a 
complete exposition of the effects of inclusions; rather, they are 
presented mainly to illustrate some ways in which inclusions may 
influence fracture behavior within the limitations of the test 
conditions employed. 


Materials and Testing 


The materials tested were taken from several large forgings of 
two general types. The low-temperature fracture and cyclic- 
loading tests were conducted on the Ni-Mo-V type of steels. 
For the specific Ni-Mo-V steels used, the major alloy contents 
were in the range 0.25 to 0.30 per cent C, 0.6 to 0.9 per cent Mn, 
2.5 to 2.7 per cent Ni, 0.4 to 0.5 per cent Mo, and 0.1 per cent V. 
The high-temperature rupture and fracture tests were conducted 
on a Cr-Mo-V type of steel. The nominal amounts of the major 
alloys in this type of steel are 0.3 per cent C, 1 per cent Cr, 1 
per cent Mo, and 0.25 per cent V. The amounts of nonmetallic 


Contributed by the Metals Engineering Division and presented 
at the Metals Engineering Conference, Albany, N. Y., April 29-May 
3, 1959, of Tue American Society or MECHANICAL ENGINEERS. 

Norte: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not of the 
Society. Manuscript received at ASME Headquarters, February 24, 
1959. Paper No. 59—Met-10. 


Fig. 1 Distribution of inclusion segregations in a Ni-Mo-V steel forging. 
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steady and cyclic loading. 


elements present in the steels studied were in the normal range 
of values occurring in these types of steels. The forgings were 
heat-treated according to conventional large forging practice re- 
sulting in a tempered bainitic type of microstructure. 

Since a number of different kinds of test specimens are involved 
in these observations, their description will be given in connection 
with each group of results. At this point, it will only be noted 
that, in the case of notched specimens, the stress values given are 
the nominal values calculated assuming elastic behavior and ig- 
noring the stress-concentration effects of the notches. 


Influence in Low-Temperature Fractures 


A major low-temperature fracture problem in Ni-Mo-V steels 
is the one associated with the ductile-to-brittle transition charac- 
teristic of all ferritic steels. In studying this problem, notched 
specimens are often used. Previous work [1, 2]' has shown that 
when notch acuity, specimen size, and testing conditions are ap- 
propriate, the brittleness may be manifested in low-notched frac- 
ture strength. When the critical region at the root of the notch 
contains an unusual number of nonmetallic inclusions, even 
greater losses in strength can occur. The inclusions in these in- 
stances consist principally of sulfides which are segregated into 
localized regions. The macroscopic and microscopic appearances 
of these regions have been discussed previously [3]. These segre- 
gation regions are relatively small in cross section but have con- 
siderable length. A typical dimension would be 0.1 in. in diame- 
ter by four in. in length. They are randomly distributed in cer- 
tain areas of the forging with the long direction generally parallel 
to the longitudinal direction of the forging. Fig. 1 shows the 
distribution of these regions observed in a Ni-Mo-V forging re- 
jected from othér considerations. 

Examples of low-fracture strengths whenever the notch root 
intersected these regions can be illustrated by some data for the 
notched slow-bend type of specimens. The segregation regions, if 
present, were oriented such that they lie in the plane of the notch 
and intersected the notch root nearly perpendicularly. Thus 
high stresses and strains are imposed transversely across these 
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regions. The specimens were either 1-in. sq with a 0.2-in. deep 
notch finished to 0.003-in. radius or 2 in. square with 0.4-in. deep 
notch finished to a 0.005-in. radius. The results for several Ni- 
Mo-V steels tested at room temperature are shown in Table 1. 


Table 1 Influence of inclusion segregation in notched-bend tests of 
Ni-Mo-V steels 


——Nominal bend strength, psi-—~ 
No segregation Segregation 
at notch at notch 


193 ,000 148,000 
158,000 

173,000 80,000 
117,000 


185,900 91,000 


Steel and specimen 
size, sq in. 
A, 1 
C, 2 


E, 2 


The results indicate that the strength of these severely notched 
specimens may be reduced by as much as one half when an inclu- 
sion segregation region coincides with the notch root. In each 
of the low-strength specimens, the presence of the segregation 
regions could be revealed in the fracture surface by macroetching 
the fracture. A typical appearance is shown in Fig. 2. 

The probable influence of inclusions was also observed in slow- 
bend tests of smaller notched specimens from a Ni-Mo-V steel. 
The specimens tested were standard V-notch Charpy specimens. 
In this case, the inclusions appear mainly to influence the amount 
of deformation to fracture with very little effect on strength. The 
reason for this becomes apparent by noting the shape of the nomi- 
nal stress versus deformation curve for this type of specimen 
shown in Fig. 3. The data shown were obtained from a number 
of specimens by loading in successive increments, unloading, and 
measuring the amount of permanent deflection which is con- 
verted to plastic bend angle from geometrical considerations. 
Several of the specimens were loaded to fracture and, although 
the amount of deformation at fracture may vary considerably, 
the nominal stress at fracture is little affected. Since these tests 
were conducted at a temperature below the fracture transition 
temperature of the steel, the specimens broke suddenly and com- 
pletely through at fracture. Several other specimens were de- 
formed partially to amounts indicated in Fig. 3 and then care- 
fully examined for crack initiation in the region below the notch 
root. In specimens deformed to more than about 1-deg bend 


250 


NOMINAL BENDING STRESS (/OOOPS!) 


angle, small microcracks were found beneath the notch apex and 
they were usually associated with nonmetallic inclusions. A 
typical example is shown in Fig. 4. None of the slow-bend 
Charpy specimens contained large segregation regions of the kind 
described previously but it appears that even small inclusions can 
act as crack initiation points in notched specimens. 

From Fig. 3 it is seen that, if a Charpy specimen contained a 
segregation at the notch and the deformation at fracture were re- 
duced to a small value, the effect on nominal strength would still 


Fig. 2. Inclusion segregation in the fracture surface of a notched bend 
specimen of a Ni-Mo-V steel revealed by etching with 10 per cent am- 
monium persulfate 


% FRACTURE CONDITIONS FOR BROKEN SPECIMENS 
4 TERMINAL CONDITIONS FOR MICRO- SECTIONED 


SPECIMENS 


— RANGE OF @ VALUES FOR ALL SPECIMEN 
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Fig.3 Slow bend deformation and fracture behavior of V-notch Charpy specimens from a Ni-Mo-V steel, 
room-tem 


perature tests 
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Fig.4 A crack iated with inclusi located just beneath the notch 
apex in a partially deformed V-notch Charpy specimen of a Ni-Mo-V 
steel. Magnification, 100 X. 


i 


be small. However, the energy required for fracturing would be 
greatly affected since it is related to the area up to fracture under 
a curve such as in Fig. 3, plus a small component due to the elastic 
deflection of the specimen. Thus, in small notched specimens, the 
fracture energy values should be sensitive to the presence of in- 
clusion segregations. A large number of V-notch Charpy impact 
tests had been made on the same forging as used for these slow- 
bend Charpy tests and the results were statistically examined. 
Of a total of 146 specimens from a number of locations in the 
forging tested at various temperatures below the fracture transi- 
tion temperature, about 8 per cent had abnormally low-energy 
values. It was not established that every instance of a low- 


rection of inclusion; room-temperature test. Magnification, 250 X. 
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Fig. 5 A crack in an inclusion extending in the matrix in a small unnotched bend specimen of a Ni-Mo-V steel. Stressed transverse to long di- 


energy value was associated with segregation regions but it seems 
fairly reasonable that this would be the case. It should also be 
noted that this 8 per cent figure is for one forging only and the 
generality of the value is uncertain at present. 

The microaspects of the sensitivity of these segregation re- 
gions to fracturing was also investigated by making detailed 
microscopic observations on small (0.05 in. thick X #/, in. wide) 
prepolished unnotched bend specimens strained in small incre- 
ments. In segregation-free specimens, strains of approximately 
20 per cent did not result in fracture. Specimens containing a 
segregation region oriented so that the stress was parallel to the 
long direction of the segregation region did not fracture at strains 
of 20 per cent. However, if the segregation region was oriented 
transverse to the stress direction, fractures occurred at about 6 
per cent strain. For either orientation of the segregate region, 
the larger inclusions within the segregate cracked at low strains 
(0.5 to 1 per cent). With the parallel orientation, the cracks in 
the inclusion did not extend into the matrix until large strains 
were imposed; while with transverse orientation, the cracks in the 
inclusions quickly extended into the matrix and led to final frac- 
ture. Fig. 5 shows an example of a crack in an inclusion and 
partially extended into the matrix. This situation probably re- 
sults from the ellipsoidal shape of the individual inclusions creat- 
ing the highest stress and strain concentration at its extremities. 

The influence of inclusions often appears in conventional ten- 
sile tests of specimens from heavy forgings. It is a common ob- 
servation that the ductilities parallel to the longitudinal direction 
are generally higher than in the transverse direction. Some tests 
indicate that inclusions which become elongated in the longitu- 
dinal direction during forging may be the predominate factor in 
causing this difference. Examination of specimens partially 
strained and also after fracture show that small fissures are formed 
prior to fracture in transverse specimens but not in parallel 
specimens. Microexamination of the fissures usually show them 
to be associated with nonmetallic inclusions. Apparently, either 
these fissures or the strain concentrations around inclusions lead 
to fracture at smaller over-all strain in the transverse specimens. 

The association between inclusions and tensile fracture duc- 
tility may have some relationship to the correlation derived for 
the bursting strengths of large notched disks [4]. In this corre- 
lation, it appears that tensile ductility in addition to V-notch 
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Fig.6 Rupture life at 1000 F 
for tensile and disk specimens 
of a Cr-Mo-V steel austeni- 
tized at 1750 F 
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' Fig.7 Rupture life at 1000 F 
for tensile and disk specimens 
of a Cr-Mo-V steel austeni- 
tized at 1850 F 
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Charpy transition behavior can be a factor influencing the 
strength of these notched disks. It may be that tensile ductility 
provides an index to the existence and distribution of inclusion 


segregations and the possibility that they may coincide with the 
notches in the disk. 


Influence in High-Temperature Fractures 


In high-temperature strength and fracture behavior, the major 
problem is the formation and growth of intergranular cracks under 


continued stressing. Much work has been devoted to the specific 
influences of composition, heat-treatment, and microstructure on 
this phenomenon. In this section, some observations of the in- 
fluence of nonmetallic inclusions in the Cr-Mo-V type of steels 
are presented. 

These observations are for specimens from two types of tests, 
the unnotched tensile stress-rupture and the centrifugal spin- 
rupture of bored disks. The tensile specimens were of conven- 
tional design with 0.252-in-diam test section. The disk speci- 
mens were 11 in. in diam, 0.5 in. thick, with a 1.25-in-diam center 
hole. Specimens were obtained from two forgings which differed 
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principally in heat-treating temperatures, in that one was 
austenitized at 1850 F and the other at 1750 F. The stress- 
rupture results for tests at 1000 F are shown in Figs. 6 and 7. 
The stress values for the disk specimens are in terms of an average 
tangential stress which is the centrifugal load divided by the net 
cross-sectional area along a diameter. For the material austeni- 
tized at the lower temperature, the disk-rupture times are almost 
equal to the tensile-rupture times. In the case of the higher 
austenitizing temperature, the disk-rupture data are very limited 
and erratic but it appears that the disk-rupture life is somewhat 
less than the tensile-rupture life. The material austenitized at 
1850 F has a higher tensile stress-rupture strength but lower 
ductility and evidently, it is this deficiency in ductility which 
precludes a realization of the higher strength potential in a disk. 

The disk-rupture tests were periodically interrupted for visual 
inspection for crack formation and considerable differences in 
behavior between materials were noted. In the specimens 
from the material austenitized at the lower temperature, cracks 
formed near the center hole at about one half of the final rupture 
life and continued to grow until final fracture occurred. In the 
specimens from the material austenitized at the higher tempera- 
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Fig. 8 Typical intergranular cracks in intact areas of tensile and spin-rupture specimens of a Cr-Mo-V steel austenitized at 1850 F. 


tion, 250 X for (a) and 1000 X for (b). 


ture, no visually observable cracks formed even on inspection at 
very close to the final failure time. These differences in behavior 
have not been completely rationalized at present but they do in- 
dicate the possibility of a significant relationship between low 
tensile-rupture ductility and a propensity to a sudden type of 
fracturing in disk-rupture tests at elevated temperatures. 

It is a general observation that low tensile-rupture ductility is 
associated with extensive intergranular crack formation. Micro- 
scopic examination of the intact areas of both the tensile and disk 
specimens showed this to be the case, particularly in the speci- 
mens from the steel heat-treated at the higher austenitizing tem- 
perature. Cracks were located throughout the length of the ten- 
sile specimens but were concentrated near the center hole in disk 
specimen. The pertinent feature of these cracks was that, in 
nearly every instance, they were associated with nonmetallic 
inclusions as evidenced by micrographs in Fig. 8. Most of these 
nonmetallics seem to be either sulfides or silicates. Cracks asso- 
ciated with inclusions were also found in the steel austenitized at 
the lower temperature although, in this case, they seemed to be 
associated with regions having a much larger concentration of 
inclusions. 

These results imply that increased elevated-temperature 
strength in nonuniformly stressed components such as rotating 
disks is not achieved by simply increasing the tensile stress-rupture 
strength of the material at the expense of rupture ductility. 
Furthermore, in attempting to increase the rupture ductility of 
steels, it appears that attention should be devoted to minimizing 
the amount of nonmetallic impurities. It should also be men- 
tioned that the lower austenitizing temperature is representative 
of present heat-treating practices for Cr-Mo-V forgings. 


Influence in Cyclic-Loading Fractures 


Several investigators [5, 6] have already pointed out the gen- 
erally detrimental effect of inclusions on the fatigue strength at 


many numbers of cycles. The improved fatigue properties of 
vacuum melted steels are usually attributed to this factor. The 
observations in this section refer specifically to influence of non- 
metallics on the cyclic-loading behavior when the number of 
cycles to failure is relatively small and, as a consequence, the 
stresses are high. Furthermore, the studies deal with steels 
tested below their ductile-to-brittle transition temperature, so 
that fracturing by brittle crack propagation is also possible. 
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In one group of experiments, large notched bars were subjected 
to cyclic-bending loading. The loading was unidirectional so 
that the notched side goes through a zero-to-maximum tension 
cycle. With sharp notch radii and high nominal stresses, crack 
initiation occurs very early and most of the cyclic life is spent in 
fatigue crack propagation. The specimen fails when the local 
conditions at the edge of the deepening fatigue crack becomes 
critical with respect to initiation of a brittle, fast running fracture. 
The number of cycles that elapses before this critical condition 
occurs is determined by the load level imposed, the stress increase 
due to reduction in load-carrying area, the sensitivity of the 
steel to brittle fracturing and the existence of localized ductility 
deficient regions in the steel. 

It has been observed in such tests on Ni-Mo-V steel specimens 
that, when an inclusion segregation region of the kind described 
previously is present in the plane of the notch and intersects the 
notch root, a large decrease in cyclic life occurs. In one set of 
comparative tests, the presence of a segregation region reduced 
the cyclic life by about one half. The shorter cyclic life was also 
accompanied by a smaller depth of fatigue crack propagation be- 
fore brittle fracture occurred on the final cycle. These results for 
eyclic-loading conditions are mainly a consequence of the reduc- 
tion in the single cycle-loading fracture strength due to the in- 
clusion segregation. The decrease in cyclic life does not neces- 
sarily mean that a fatigue crack propagates more rapidly when a 
segregation is present. 

Another example of the influence of inclusion segregation re- 
gions is illustrated by the series of specimens shown in Fig. 9. 
In these specimens, the notch axis is oriented parallel to the long 
direction of the segregation regions. The dark appearing spots 
visible on the side surface of the specimens locate the segregations. 
It can be seen that, with this situation, a fatigue crack which 
starts at the notch bottom may, or may not, encounter a segrega- 
tion in the course of its propagation. For a given stress, longer 
cyclic life can be expected if the encounter distance is large. The 
specimens in Fig. 9 were 1.5 in. sq with a 0.3-in. deep notch 
finished to a 0.005-in. radius. When cycled to 125,000 psi 
maximum nominal bending stress, the cyclic lives ranged from 

783 to 5626 cycles. It is evident from Fig. 9 that increased 
cyclic life is associated with a longer encounter distance in these 
specimens. 

In addition to these situations where the growing fatigue cracks 
may encounter inclusion segregations, the inclusions themselves 
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Fig. 9 Distribution of inclusion segregations in high-stress notched fatigue specimens of a Ni-Mo-V 
steel. Cycles to failure for identical cycling loads: Upper specimen, 3783; middle specimen, 4064, bot- 


tom specimen, 5626. 


can serve as initiation points of fatigue cracks in the case of un- 
notched specimens. This aspect of the fatigue problem has been 
discussed in reference [6] in which it is pointed out that, under 
low-stress high-cycle conditions, a crack will be initiated only at 
the largest inclusion and this crack propagates to final fracture. 
For high-stress low-cycle conditions, cracks start at all sizes of 
inclusions and then join up to cause final fracture. The latter 
situation was observed in unnotched specimens of a Ni-Mo-V steel 
tested in axial tension-compression loading. Macroetching re- 
vealed that each of the specimens contained a number of in- 
clusion segregations running transversely through the specimen 
at various points in the test section. The stress levels studied were 
in the range from slightly less than the elastic limit to well into the 
plastic range. In nearly all of the specimens, a fatigue crack 
initiated at one of the inclusion segregations and progressed to 
fracture indicating that these segregations can have an influence 
on low-cycle high-stress fatigue behavior even in unnotched 
specimens. 

From these several examples, it seems clear that considerable 
increases in low-cycle fatigue behavior can be expected if these 
inclusion segregations can be avoided. As yet, the data are not 
sufficient to indicate how small and well dispersed the inclusions 
need to be to realize the improvement. 


Summary and Conclusions 


In studying the influence of nonmetallic inclusions on the 
strength and fracture properties, definitive results are difficult to 
obtain. However, the examples and observations presented have 
attempted to show some test conditions under which certain 
specific forms of inclusions can adversely affect the strength and 
fracture behavior of alloy steels at low and high temperatures 
and under cyclic loading. 
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iii EXCEPTIONAL SITUATIONS a crack traversing 
a plate in tension takes a path at right angles to the direction of 
greatest tensile stress. When there is a substantial plastic reduc- 
tion of plate thickness leading the crack the observed fracture 
mode is usually a sliding separation at 45 degrees to the plate 
surfaces sometimes termed a ductile or shear fracture. On the 
other hand when the contraction of thickness is relatively small 
the locus of the crack is usually a plane normal to the direction of 
greatest tensile stress and thus at 90 degrees to the plate surfaces. 
For this fracture mode designations such as brittle, flat-tensile, 
separation, and opening mode are sometimes used. Flat-tensile 
and brittle are employed here for this purpose as equivalent terms. 

Often used with reference to fracture appearance, the terms 
ductile and brittle convey, in addition, the inference of a relatively 
large deformation work rate associated with a ductile as compared 
to a brittle mode of fracturing. Three major factors which con- 
tribute to a ductile-to-brittle fracture mode transition have been 
known for many years. These are increase of dimensional size, 
elevation of yield stress by temperature reduction or rapid stress- 
ing, and increase of microstructural weakening effects. Two 
examples of the latter are, (1) segregations and loose bonding at 
grain or solidification boundaries and, (2) a distribution of lattice 
strains or impurity atoms which favors stress relief by cleavage in 
place of stress relief by plastic deformation. 

Two 1958 papers, References [1 and 2],! discussed conditions 
for the ductile-brittle fracture mode transition in terms of speci- 
men thickness, yield stress, and critical fracture work rate in the 
flat-tensile fracture mode. Reference [1] was primarily concerned 
with a certain class of fracture strength measurement techniques. 
Reference [2] was a general review of fracture mechanics. In this 
paper the fracture mode transition itself is the major topic. The 
discussion will support and extend the remarks on fracture transi- 
tion given in the previous papers. As in the current paper by 
B. M. Wundt [3] the crack extension force suggested by the 
Griffith theory is of central] interest. 

It has been pointed out [4, 1] that crack propagation is of prac- 
tical importance primarily when it occurs through a section sub- 
jected to an average tensile load less than would be required for 
general plastic yielding. Under such conditions an analysis of 
stresses neglecting plastic strains has an obvious approximate 
validity. Advantage can be taken of this in several ways. By 
eliminating plastic strain complexities from the stress analysis one 
can define the crack-extension-force in a manner convenient both 
for calculation and measurements. One can then establish in a 
systematic way a procedure for using laboratory fracture test 
measurements to estimate the strength of structural components 
containing various size cracks. 


The Crack Edge Stress Field 


For the purpose of this discussion we need to consider only the 
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Fracture Mode Transition for a 
Crack Traversing a Plate 


portion of elastic stress field which encloses the advancing edge of 
the crack. The stress field to be considered is an approximate 
description of the stresses in an annular region which excludes 
the region of inelastic strains associated with the fracturing proc- 
ess but does not extend outward more than a small fraction of the 
length of the crack. 

Using z, y and r, 6 co-ordinates as shown in Fig. 1 the equations 
for the stresses in this region, as given in Reference [1] are 


cos 6/2 . 36 
[1 — sin sin Ces (2) 
Tr = K cos cos (3) 
oa, = 0 (plane stress) (4) 
o,=v(o,+,) (for plane strain) (5) 


where vy = Poisson’s ratio. 

The stress intensity factor x is related to the Griffith theory 
strain energy release rate G by the following equations 
(6) 


c= 3 (for plane stress) 


(7) 


(for plane strain) 


The parameter o,, may be omitted since it affects the discussion 
only in minor ways and since it is, in any case, small compared to 
«/+/2r when r is small. The equations for the principal exten- 
sional stresses in the z,y-plane then have the following simple 
form 


cos 6/2 6 

(1 + sin 5) (8) 
cos 6/2 

=K ( sin (9) 


Fig. 1 Two-dimensional crack, showing the (x, y) and (r, 0) co-ordinates 
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when a stress in the z-direction is relevant as in a situation of 
plane strain the third principal extensional stress is 


cos 0/2 
o, = 2K Var (10) 


Plane-Strain Conditions Near the Crack Edge 


For purposes of mathematical analysis a brittle or flat-tensile 
fracture is considered to be a flat internal free surface bordered 
at the advancing edge by a zone of plastic strains of relatively 
small size. To satisfy free surface conditions at the crack sur- 
faces, the stress system adjacent to the advancing edge must be 
one of plane-strain. However, for a through-thickness crack 
moving across a plate, the zone of yielding at the advancing edge 
tends to relax the stress in the plate thickness direction. Obvi- 
ously a plane-strain stress sytem cannot be present where the 
crack intersects the plate surface. Thus a brittle fracture travers- 
ing a plate normally has a perceptible border at the plate surfaces 
where the mode of fracturing changes from flat tensile to shear. 

Consider next the conditions under which a flat-tensile mode of 
fracture can be maintained in central regions of the plate thick- 
ness. An accurate representation of the stress relaxing influence 
of plastic strain near the advancing edge of a crack is not at 
present possible. However, for the general features with which 
we are concerned, estimates of an approximate nature can be 
employed. For estimating the influence of stress relaxation from 
a local zone of plastic strains an elastic stress distribution prob- 
lem studied by Sternberg and Sadowsky [5] is helpful. For a 
plate in uni-axial tension Sternberg and Sadowsky were interested 
in how stresses near the borders of a cylindrical hole in the plate 
varied in character with the ratio of hole diameter to plate thick- 
ness. Their calculations indicated the stress system near the 
middle plane of the plate differed very little from one correspond- 
ing to a plane strain situation when the hole diameter was less 
than half of the plate thickness. Thus the existence of a plane 
strain stress situation near the advancing edge of a flat-tensile 
fracture in a central portion of the plate thickness suggests that 
the stress relaxing influence of the plastic strain zone is no more 
effective than that of an open round hole whose diameter is less 
than half the plate thickness. 

From Equation (8) the distance r from the crack edge at which 
the predicted tensile stress equals the tensile yield stress is great- 
est at 6 = 60 deg where it has the value 


(11) 


The diameter of the open hole whose stress relaxing influence 
would be comparable to that of the crack edge plastic strain zone 
will be taken, provisionally, as equal to ry. A proportionality 
factor adjustment, if necessary, can be made later. The plate 
thickness B necessary for a central region of flat-tensile fracture 
is then given by the equation 


(12) 


For application to problems of crack propagation the stress in- 
tensity values « of primary interest are the values x, which pertain 
to onset of rapid crack extension. These values, however, vary 
with plate thickness. As shown in previous papers [1, 2] sub- 
stantial increase of x,2 occurs with reduction of plate thickness 
during the approach toward fracture mode transition. Fig. 2 
shows this variation as observed for 7075-T6 Al alloy. The ordi- 
nate used in Fig. 2 is the value of G, as computed from Equ- 
ation (6). The difference from unity of the factor (1 — v*) of 
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Equation (7) is neglected here regardless of the degree of plane 
strain. 

From Equation (12) the critical value of the reciprocal of the 
plate thickness for fracture mode transition may be estimated as 


(13) 


A portion of the fracture transition locus line predicted by Equa- 
tion (13), for gy = 72,000 psi and = 10.5 X 10° psi, is shown on 
Fig. 2. This locus intersects the experimental S, curve at the 
point where 1/B is 4.2 (in.)~! and §, is 215 lb/in. The fact 
that this position on the curve of Fig. 2 marks the beginning of 
the fracture mode transition from flat-tensile to shear is evident. 
A comparison of the fracture appearances for the thicknesses 
0.25, 0.187, and 0.125 in. is shown in Fig. 3. 

In order to compare the fracture mode transition behavior of 
various materials it is convenient to express plate thickness in 
terms of a nondimensional parameter @ where 


Bo,* 


a 


(14) 


S:. is the value of S, for completely plane-strain fracturing as 
shown in Fig. 2. Values of §, observed for aluminum alloy 
plates and a rotor steel are shown in Fig. 4. Even allowing for 
considerable experimental scatter it is apparent that a trend curve 
through the observed S, values would cross the Equation (13) 
locus line at values of a not far from unity. The fact that the 
fracture mode transition from flat-tensile to shear occurs with @ 
decreasing through the range 1.5 to 0.5 will be given additional 
illustration later. 


Abruptness of the Brittle-Ductile Transition 


That the change from a brittle to a ductile fracture mode 
should be relatively abrupt with decrease of plate thickness can be 
understood as follows: In the preceding discussion the influence of 
the zone of yielding at the advancing edge of a crack was con- 
sidered equivalent to a selected diameter hole in its tendency to 
relax stresses in the plate thickness direction. The equivalent 
hole diameter was assumed to be proportional to the size of the 
crack edge plastic strain zone. The shear stress corresponding in 
direction with the type of plastic strain which would result in 
plate thickness reduction is 


>/1N 


Fig. 2. Variation of S. with reciprocal of plate thickness showing the 
Gic intercept 
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Fig.3 Fracture appearance for 1/4, 3/16, and 1/8-in. plates of 7075-16 aluminum broken in tension 
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Fig. 4 as function of 1/a for three materials showing intersections 
with Equation (13) near the position where a is unity 
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2024 T-4 ALUMINUM 


(15) 


The positions r, @ from the mathematical edge of the crack where 
Ti, has a fixed value are shown by curve A on Fig. 5 for the 
plane-strain stress system of Equations (8) and (10). Estimating 
the value of the same shear stress component by putting 7, equal 
to zero to represent plane stress, the locus of the same fixed 
Tis Value is shifted out to curve B of Fig. 5. A comparable ex- 
pansion in size occurs for the crack edge plastic strain zone when 
a reduction of plate thickness produces a change from the flat- 
tensile to the shear fracture mode. When the plate thickness is 
too small to meet a flat-tensile fracture condition, for example 
Equation (12), the flat-tensile fracture mode even if initiated, say 
by notch sharpness, is unstable. As the crack extends, the plastic 
strain zone becomes larger than the size characterizing a situation 
of plane strain. When this occurs the shear stress component 
tending to cause thickness reduction increases from relaxation of 
o,. Thus the plastic zone size becomes still larger, then 7:, must 
become larger again, and the process continues toward a situation 
of plane stress with complete relaxation of o,. 

From the reasoning given the change of fracture mode with de- 
crease of plate thickness should be abrupt requiring only a small 
change of plate thickness. For 7075-T6 aluminum a factor of two 
thickness reduction from */, to */1 in. is required for the change 
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x 
Fig.5 Curve A shows locus of a fixed value of (7; —o:)/2 at the edge of 
a fiat internal free surface in tension assuming plane strain. Curve B 


shows the expansion of this fixed value curve when a, is assumed to be 
zero, 


from a flat-tensile fracture with small shear lips to a 90 per cent 
shear fracture. This may seem less abrupt than would be theo- 
retically expected. Several factors affect the degree of abrupt- 
ness of the fracture mode transition. The presence of the shear 
fracture side borders for a flat-tensile fracture traversing a plate 
causes the crack to lead in the center of the plate and lag at the 
edges. This assists maintaining of plane-strain stress conditions 
along the leading edge of the flat-tensile crack in ways not con- 
sidered in Equation (12). 

A second factor is present when the materia] has a dynamic 
yield stress elevation. In the usual fracturing experiment the 
crack extension speed increases with increase in crack length. 
When crack extension begins from a sharp natural crack or notch 
which extends through the plate thickness growth of shear lips 
at the side surfaces which might otherwise result in a completely 
shear fracture may be held to a fraction of the plate thickness if 
sufficient increase of yield stress occurs due to speed of crack ex- 
tension. In fact, for steel plates one often observes first a growth, 
then a reduction, of shear lip size as a crack spreads from a sharp 
notch. The influence of this factor upon fracture mode transition 
depends upon the observed length of run of the crack and the 
character of the fracture test observations. 


Effect of Yield Stress and Temperature 


By the same reasoning as was applied to the effect of plate 
thickness, the crack edge plastic zone size is expected to increase 
with decrease of ay and with increase of G,,. These changes of 
oy and G;,, when of sufficient magnitude, can produce a transition 
from brittle to ductile fracture. For a steel plate, decrease of oy 
and increase of G,- occur together with increase of testing tempera- 
ture. Thus starting with conditions which result in a flat-tensile 
fracture, it should be possible to increase (§;,/ay*) enough by 
elevation of testing temperature tocause transition to a shear frac- 
ture. Basically the fracture mode transition occurs because the 
size of the plastic strain zone at the advancing edge of the crack 
increases relative to the plate thickness. In accordance with the 
line of reasoning presented for the effect of plate thickness, an 
abrupt change with temperature of S,,., of oy, or of both is not 
required for an abrupt change to occur in the mode of fracturing. 
The instability toward fracture transition with growth of the 
plastic zone size is inherent in the stress considerations previously 
discussed. Although a temperature dependent change in the 


mode of crystalline separation may exist and may have a sig- 
nificant influence through its effect upon G,,, the general charac- 
teristics of a fracture transition can be understood without specific 
reference to the mode of crystalline separation. 

Measurements published by Winne and Wundt [6] for flat- 
tensile fracture of two turbine rotor steels indicate a factor of two 
increase of S, with a temperature increase from room temperature 
of 200 F. In the same interval oy* decreased by only 14 per cent. 
Examples of a factor of two change in ay" in a 200 F temperature 
interval are rare. One instance of such a change can be found in 
the dynamic yield stress data for a ship steel studied by Tipper, 
Sullivan, and Krafft [7]. Thus for a fracture mode change with 
temperature the change of G;, is usually the factor of dominating 
importance. However, the influence of yield stress upon the a 
value cannot be neglected particularly in estimating the effect of 
impact as compared to static stressing of a test specimen. Fur- 
thermore, the equation for a does not represent explicitly the en- 
tire influence of a change in oy because G;, must be sensitive to 
the plastic flow resistance of the material. 


— and Calculations Related to Fracture Transi- 
on 


Certain measurements on aluminum alloys and steel have al- 
ready been discussed. The table of results given in Reference [1] 
provides additional] illustrative data. In Table 1 of this paper 
some of the previous data for metals are repeated with the values 
of S;, recalculated using the empirical adjustment equation at the 
bottom of the table. Recent measurements of G, for high strength 
sheet steels are included. The tabulated results were extracted 
from experimental data obtained by Herschel Smith, NRL, 
Harold Bernstein, NGF, and Hugh Romine, NPG. Although be- 
lieved to be correct for the samples tested the steel data may not 
represent typical fracture toughness properties of the materials. 

When applied to an experimental curve of G, as a function of 
reciprocal plate thickness, Equation (13) indicates a position on 
this curve which borders the fracture mode transition on the 
brittle side. Primarily Equation (13) expresses the fact that only 
a limited size zone of plastic yielding at the leading edge of a crack 
is compatible with maintaining plane strain conditions in central 
regions of the plate. The value of a pertaining to this point is ap- 
parently not far from unity for materials so far investigated. For 
the test results of Table 1 the fracture mode transition, from 40 
per cent shear to 100 per cent shear, is completed for all the ma- 
terials tested within the a-value range from 1.2 to 0.5. The cor- 
responding a-value range for a specific material appears to be less 
than half of this interval. 

The introduction and use of the parameter a was motivated in 
part by the thought that the relationship of S, to plate thickness 
for a wide range of metals might be approximately representable 
as 


Ge = Gre f(a) (16 


A valid relationship of this form would be quite useful since a 
measurement of S, at one plate thickness could then be used to 
predict values of §, for other thicknesses. Data such as that 
shown on Fig. 4, although not closely representable by Equation 
(16), nevertheless show a sufficient resemblance in major features 
to suggest ways in which the relative plate thickness parameter a, 
can be useful in the analysis of fracture test measurements, 
Consider, for example, recent measurements by Srawley and 
Beachem [8] of fracture appearance transition temperature for 
certain sheet steels having tensile yield strengths above 180,000 
psi and thicknesses of '/sin. or less. The test specimens employed 
were '/--in. wide tensile strips with a central slot. Brittle crack 
extensions at both ends of the central slot were developed prior 
to testing. The fracture mode transition produced in these tests 
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Table 1 


Thickness, 
in. Aluminum alloys 

7075-T6 

7075-T6 

7075-T6 

7075-T6 

7075-T6 

7075-T6 

7075-T6 

7075-T6 


2024-T4 
2024-T4 
2024-T4 
2024-T4 
2024-T3 
2024-T3 


Steels 
SAE 4340 
SAE 6150 
Tricent 


RR ES 


X-200 
Vasco X-3, G’ 
Vasco X-3, G’ 


AMS 6434(0.36 C) 
AT-3,4 4-400° 
AL-5,4 6-400° 
AT-9, 10, 11, 12-500° 
AL-15, 16-500° 
AT-1, 2-400° 
AL-7, 8-400° 
AL-13, 14-500° 
AT-17, 18, 19-725° 
AT-20-725° 
AL-21, 22-725° 
Longitudinal-RT 
Longitudinal-0°F 
Transverse-RT 
Transverse-O°F 


Titanium B-210 VCA 
Longitudinal 
Transverse 
Longitudinal 
Transverse 
Longitudinal 
Transverse 


= 


* Corrected for plastic zone stress relaxation. 


Computed by assuming = Sie { 1 + 


Cy, 10* psi 


225( Nom.) 
225( Nom.) 


G.*, lb/in. 
115 
115 
115 
115 
115 
115 
115 


on 


0. 
0. 
1. 
0. 
0. 
0.7: 
0. 
0. 
0. 
0 
1. 


¢* Not estimated unless crack length was more than three times plate thickness. 
4 AL and longitudinal specimens were broken across the direction of stringer orientation, 
AT and transverse specimens were broken along the direction of stringer orientation. 


by variations of the testing temperature was sharply character- 
ized by the point at which the shear lips spread together giving a 
100 per cent shear fracture. 

Fig. 6 from the Srawley-Beachem data shows the critical plate 
thickness for the 100 per cent transition as a function of tem- 
perature for two steels. Steel B was a 12 Cr, 0.25 C martensitic 
stainless stee] somewhat embrittled at the 900 F tempering tem- 
perature. Steel G was a hot-work die steel. Both steels and 
their heat-treatments are described in Reference [8]. The dashed 
lines in Fig. 6 are considered equivalent to a 0.2 per cent offset 
tensile yield stress. However, precise yield stress measurement 
was not of central interest in this work. For steel B a single yield 
stress-temperature relation is shown for all three sheet thick- 
nesses. For steel G the measurements suggested the yield stress 
of the 0.10-in. sheet exceeded that of the two thinner sheets 
through out the range of testing temperatures. 

Using Equation (14) and the Srawley-Beachem curves for frac- 
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ture appearance as a function of temperature, values of S,, were 
calculated on the assumptions that a had the fixed value 1.0 for 
the 50 per cent shear fracture point and the fixed value 0.7 for 
the 100 per cent shear fracture point. In view of the steepness 
of the curves in this region it was evident this portion of the frac- 
ture mode transition corresponded to a smaller change of a. 
However, it was felt the results of the computations might serve 
both to bracket the range of probable ,, values and at the same 
time would indicate the uncertainty in estimates of this type. 
The results are shown in Fig. 7. Also shown in Fig. 7 are measure- 
ments of §, from centrally notched spin-disk tests reported by 
Winne and Wundt [6]. The selected spin-disk results are for a 
low fracture strength rotor steel. Because the 6-in. thick disks 
were equivalent to large values of a, the correction of these re- 
sults to G;, values would merely lower all of them by a small con- 
stant fraction, about 30 per cent, and would not change the trend 
of the results with testing temperature. 
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Fig.6 Critical thickness for transition to 100 per cent shear fracture as a 


function of temperature for two steels. Upper curves show tensile yield 
stress. (Data from Srawley and Beachem.) 
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Fig.7 Trend of SG). with temperature for several steels 


The trends with temperature of the values of G,, calculated from 
Equation (14) are quite similar for steels Band G. From the close 
correspondence of these results to the trend of S;, with tempera- 
ture for rotor steel TC it is believed estimates of §;, from Equa- 
tion (14) combined with observations of fracture mode transition 
will be of considerable interest. 

Measurements of G, were made on two thicknesses of Vascojet 
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X-3 which is similar to steel G. Yield stress was not measured. 
From the hardness and experience with similar materials the ten- 
sile yield stress should be close to 225,000 psi. The results of these 
tests in terms of values of G,, are indicated in Fig. 7 by the symbol 
G’. One set of test plates were thick enough, 0.130 in., so that 
the adjustment, using the equation under Table 1, from &, to S,. 
was small. The Table 1 adjustment equation is an arbitrary 
choice of Equation (16) which fits the low , region of the fracture 
transition for 7075-T6, a low strength rotor steel studied by Winne 
and Wundt, and a rotor steel studied by Wessel. Although the 
range of validity of this ¢,-to-S,. adjustment equation is not 
well established, the a values in Table 1 nevertheless appear to 
be in fair agreement with the degree of fracture mode transition 
as indicated by per cent shear. 

The accuracy with which one can calculate G;, from observa- 
tions of the fracture mode transition either with change of plate 
thickness or change of temperature depends upon the accuracy 
with which a selected feature of the fracture transition can be 
characterized by a value of the parameter a. From available 
data the change of fracture appearance from 40 per cent shear to 
95 per cent shear does not span more than a factor of two change 
of @ value. Thus the error of the @ estimate should be sub- 
stantially less than a factor of two. With some refinements an 
accuracy comparable to that of S, type fracture strength measure- 
ments, say 10 to 20 per cent, might be realized. 

In the case of steels of low hardness and large dynamic increase 
of yield stress direct experimental measurement of G,, is difficult, 
particularly for dynamic loading conditions, and may have no 
more accuracy at present than the indirect procedure using Equa- 
tion (14) together with observation of fracture transition and 
dynamic yield stress. The Equation (14) procedure also merits 
consideration for estimating the flat-tensile fracture strengths of 
high yield strength sheet steels used in construction of rocket en- 
gine chambers. Direct measurements of G,, for these steels are 
inconvenient because of the small sheet thickness. However, 
the S;. values are of interest. The spread of a surface crack 
through the sheet thickness under plane strain conditions has 
been noted as the starting phase in a number of instances of frac- 
ture failure of solid propellant engine chambers. Therefore both 
as an aid to inspection and to fracture failure analysis it is de- 
sirable to know the crack size which will propagate through the 
sheet thickness under various tensile stress loads. 


Conclusions 


The usually observed abrupt change from brittle to ductile 
fracture with increase of temperature as well as the less com- 
monly studied change of fracture mode with plate thickness can 
be ascribed to the unstable growth tendency of the plastic zone 
at the leading edge of the crack when conditions necessary for a 
crack edge stress environment of plane strain in central portions 
of the plate thickness are not present. From this viewpoint the 
fracture mode transition is controlled by a relative plate thickness 
parameter a which measures approximately the ratio of the sec- 
tion thickness to the size of the crack edge plastic strain zone. 
Knowledge of values of a characteristic of selected readily ob- 
served features of the fracture within the region of a fracture 
mode transition permits an estimate of the value of S, for com- 
pletely plane strain fracturing. 
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DISCUSSION 
Frank A. McClintock? 


Dr. Irwin has shown clearly how the transition from a plane 
strain to a plane-stress mode of fracturing can be related to 
the ratio of the plate thickness B to the critical radius of the 
plastic zone for the onset of crack instability under plane-strain 
conditions R;,., as indicated by his Equation (16). But as he 
noted, tests at one thickness do not give the notched strength at 
all thicknesses. This is not surprising, for where there is a change 
in the metallurgical mode of fracture as in the case of mild steel, 
for example, one would not expect the relative notched strength 
in the two regions to bear the same relation as they do for ma- 
terials such as the aluminum alloys which do not seem to have a 
metallurgical fracture transition. Thus the plane-stress and 
plane-strain regions should be considered separately in trying to 
estimate the notched strength from more directly observable 
properties, say, by analogy with the instability of shear cracks for 
which an analysis is available [9]. That analysis dealt with an 
elastic, perfectly-plastic material of yield strain in shear y,, which 
was assumed to fracture when a strain y, was attained through- 
out a region of size p, within which the plastic deformation could 
not be considered homogeneous. The critical radius of the plastic 
zone for instability of the crack R, was given approximately by 


R, = pexp [(2 y,/y¥,) — 1)’ (17) 


The corresponding value of the nominal stress was given in terms 
of the yield stress in shear k, the crack length c, and the radius 
of the plastic zone R, by the approximate equation 


J 
= ((1 + R/c) — aw 
K R\? 
(14 *) 


At low stress levels this relation, plotted in Fig. 8 of this discus- 
sion, approaches the familiar dependence on the reciprocal of the 
square root of the crack length. 

To establish an analogy between the plane stress and the shear 
solutions, note that in the plane-stress case one would expect to 
find elastic, plastic, and necking regions such as shown in Fig. 9. 
In the plastic region one might assume that there was just enough 
workhardening to offset the decrease in thickness until some criti- 
cal value of the strain was reached, at which point necking would 
begin. This critical value of the strain can be termed the uniform 
strain €, and, while it depends on the ratios of the principal 


(18) 


? Professor, Massachusetts Institute of Technology, Cambridge, 
Mass. Assoc. Mem. ASME. 

*’ Numbers in brackets designate additional References at end of this 
discussion. 
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stresses, as a first approximation it might be taken to be the uni- 
form strain in a tensile test on a thin sheet. Since the necking of 
thin sheet contributes little to its over-all elongation, the uniform 
strain may be taken to be the elongation of sheet material in the 
ordinary tensile test. In order to permit the necking to go to com- 
pletion and form a crack it would be necessary for necking to 
occur over an area of the order of the thickness of the specimen. 
Since the necked region could not be handled with the plane- 
stress equations used in the rest of the plastic zone, it might be 
assumed that necking would be possible if the equations indicated 
that the uniform strain €, was exceeded over the area which was 
expected to neck, namely, a region whose size is approximately the 
thickness of the specimen. Since complete necking is not neces- 
sary before a shear fracture, a still better approximation might be 
to require that the uniform strain be exceeded in a region whose 
size is the thickness of the specimen multiplied by the reduction 
of area at fracture in sheet specimens in which the necking was 
constrained to occur in a direction normal to the axis of maximum 
principal stress, as for example Fig. 10. 

Finally, in approximating a material which strain hardens, the 
analog of the yield stress in shear should be the tensile strength 
of a flat strip and the analog of this yield strain should be the ten- 
sile strength divided by Young’s modulus. Thus the quantities 
Pp, and y, become 


€u p — B(R.A.), TS./E (19) 


When these equations are substituted into Equations (17) and 
(18) for the case when the crack length is long enough so that in- 
stability occurs at a small fraction of the tensile strength, the 
following equation is obtained 


For very thin foils the structural-size parameter obtained from 
the foil thickness times the reduction of area may become smaller 
than a structural-size parameter set by the grain structure or 
other properties of the materia] itself [9]. In this case the struc- 
tural size is taken from observed inhomogeneities in the deforma- 
tion of the materia] rather than from the foil thickness. 


stress, 


Applied stress 


c 
Crack length / Rodis of plostic zone, 


Fig. 8 Dependence of radius of plastic zone on applied stress 
| R | 

Plastic 


Necking 


Fig. 9 Regions of deformation around a crack 
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Fig. 10 Tensile specimen to produce necking normal to axis of tension 


In order to check the validity of Equation (20), tensile tests were 
run on strips of 7075 T6 and 2024 T3 sheet, both straight and 
constrained to neck norma! to the axis as shown in Fig. 10. For 
comparison with these results, the stress for instability was also 
calculated from Fig. 4, assuming that G,/a is constant for speci- 
mens as thin as, or thinner than, the thinnest indicated there, and 
from the equation 


c= V/. EG./me = Vv. Ea(G./a)/re 
(21) 
oy 


c 


The results are summarized in Table 2 and, while the agreement 
is by no means exact, it does appear good enough to suggest 
similar calculations and experiments with other materials, as well 
as a more detailed analysis of the elastic-plastic plane-stress 
problem. 

For very thick plates the problem approaches that of fracture 
under plane-strain conditions. In the limiting case the thickness 
of the plate is itself no longer important, as the author has 
pointed out, and the structural-size parameter p should be asso- 
ciated with the structure of the material itself. In work on 
notch sensitivity of 7075 T6 under torsion, Mackenzie [10] found 
that crack initiation from notches of various root radii could be 
correlated by taking the structural-size parameter to be 0.010 in., 
or about 5 grain diameters, and the fracture strain y, to be that 
attained in the unnotched torsion test, or about 0.7. Due to in- 
creased triaxiality in this case the fracture strain might be as- 
sumed to be considerably less, It can be estimated by taking 
the structural-size parameter p to be 0.010 in. as in the case of 
torsion and by calculating the required values of €,, from G,, ac- 
cording to the following equation, obtained by analogy from 
Equations (17), (18), (21) 


TS. 
= + 1). (22) 


It is found that €,, is 0.03 for the 2024 T3 and 0.016 for the 7075 
T6. These values of the strain are far below those encountered 
in the uniaxial tensile test and to see whether they have a real 
meaning it would be necessary to calculate the state of stress 
around a notch under plane strain and to devise some convenient 
test in which such stress and strain distributions could be simu- 
lated without going to the bother of testing an entire plate. An- 
other possibility is to test a singly grooved plane-strain tensile 
specimen. A test was carried out on a bar of 1 X 3-in. cross sec- 


- tion with a '/;-in. deep X 0.0006-in. root radius groove cut across 


the face, leaving a net area 1/2 X 3 in. Failure occurred at a 
nominal stress of 25,600 psi which, considering the tensile strength 
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Table 2 
Material 2024 T3 7075 T6 
Thickness, B 0.125 0.050 
Uniform strain in unnotched strip, 
&u 0.17 0.13 
Reduction of area in constrained 
strip, R. A. 0.06 0.17 
Tensile strength of unnotched strip, 
. psi. 70,000 83,000 
Modulus of elasticity, 2, psi 107 107 
¢ Vc/B from Equation (20) 340,000 340,000 
o Vc/B from author’s data and 
Equation (21) 270,000 290 ,000 


of 86,000 psi in the form of a '/;-in-diam tensile specimen and 
taking a structural-size parameter to be 0.010 in., gave a value of 
fracture strain €,, from the analog of Equations (17) and (18) of 
0.012. The agreement between this and the value of 0.016 ob- 
tained from the paper may be coincidental but does bear looking 
into. In particular, as the fully plastic-stress level is approached, 
the load-carrying capacity of a singly grooved specimen ap- 
proaches that of a plane-strain specimen in the fully ductile case, 
whereas the load-carrying capacity of a doubly grooved specimen 
approaches (1 + 2/2) times that value. Thus the analog of 
Equation (18) may not hold at high stress levels. In addition, a 
certain amount of bending was present in the singly grooved bar, 
which was pulled between grips 68 in. apart. The bending effect 
varies with stress level. 

If the plane-strain and the plane-stress regimes of operation can 
be determined from simple tests on the material as described 
in the foregoing then the transition between these regimes may be 
described in a way similar to that proposed by the author, except 
that two parameters, the structural size p and the fracture 
strain €,, must be described as the specimen thickness increases 
relative to the radius of the plastic zone. When the thickness of 
the specimen is very great compared to the radius of the plastic 
zone, the plastic region will prevent contraction and local necking. 
As the specimen is thinned down to the point where the elastic 
region ahead of the plastic zone begins to offer less constraint, one 
might at first think that the unyielding material either side of 
the originally cut notch would itself prevent contraction and 
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Fig. 11 Hypothetical dependence of structural size p and fracture 
strain €; on plate thickness 
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maintain a plane-strain situation. But as the crack grows, stably 
at first, this original undeformed material is left behind and neck- 
ing can occur as readily as it would in the usual tensile test of 
a straight strip. For materials where fracture occurs before 
necking, there would be no change in the structural-size parameter 
from the plane-stress to the plane-strain condition. For materials 
which do neck, however, the transition of the parameters for dif- 
ferent materials might lie on curves such as those shown in Fig. 
1l. For convenience and ready reference the plate thickness is 
nondimensionalized in terms of the radius of the plastic zone in 
the plane-strain condition R;, found from the analog of Equation 
(18) 


Ri for (¢,,/TS.)< (23) 
The thickness ratio is related to the author’s thickness parameter 
by 


B/R,, = (24) 


The relations discussed in the foregoing give a more general form 
for the dependence of notched strength on thickness than given 
by Equation (16). That equation, however, set the stage for 
this discussion, which it is hoped will provide a guide for further 
work on the notch sensitivity of sheets and plates. 
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The author is indebted to the discusser for his thoughtful com- 
ments. The paper was intended to be essentially descriptive 
rather than theoretical. In this as well as in previous papers the 
author’s working plan has been to develop procedures for giving 
quantitative characterization to major descriptive features of 
fracturing. The mathematical model of crack-extension in- 
stability established by McClintock [9] should possess a degree of 
correspondence to these major features particularly to the relative 
plastic zone size. However, the mathematical model is a sub- 
stantial oversimplification of the actual instability process. 
Thus the task of establishing correspondence to the observations 
reported in the paper required careful consideration by the dis- 
cusser. The relationship of certain features of the paper to the 
problem solved in reference [9] was skillfully indicated by the 
discusser. Although the complex problem of crack-extension 
instability remains in several respects beyond reach of current 
analytical theory, the degree of correspondence between the 
McClintock’s mathematical model and observations reported in 
the paper suggests McClintock’s analysis is a useful first step 
toward a more complete theoretical analysis of the general prob- 
lem. 
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| Introduction 


‘ LANE PLASTIC DEFORMATION in metals can be 
measured from the corresponding Moiré effect: If two screens of 
slightly crossed equidistant lines are superimposed, interference 
patterns, called Moiré fringes, appear where the lines of the first 
screen intersect those of the second. Shepard and Weller [1}! 
used this method to study deflections in transparent beams. 
J. D.C. Crisp [2] measured elongation in transparent tensile test 
specimens with the aid of a special Moiré interferometer. Ligten- 
berg [3] determined the bending moments in plates with the aid 
of Moiré pattern obtained by superimposing, on the same nega- 
tive, the reflected image of a ruled screen which the model rejected 
before and after a suitable load was applied. A work by Thiry 
and Dantu [4] on the application of the Moiré effect in the study 
of plastic deformation has recently come to the notice of the 
authors. 

If one draws fine, regularly spaced lines on an undeformed metal 
test specimen and also on a transparent screen and then deforms 
the testpiece into the plastic range, Moiré fringes are formed 
when the transparent master grid is superimposed upon the de- 
formed grid. This provides the basis from which the amount of 
plastic deformation may be calculated. This method has the 
advantage over the photogrid method in that the measurements 
are not so tedious, that the regions of particular interest are 
clearly shown, and that pictures for measurement can quickly 
be taken at any stage of the plastic flow. It should be possible 
to measure elastic as well as plastic strains by using sufficiently 
fine screens. 


il The Grid 


A first attempt to obtain a grid having regular spacings of 
about 0.2 mm was made by drawing lines directly on the polished 
surfaces of a mild-steel test specimen with a sharp chisel. A 
similar set of lines was scribed on a sheet of plexiglas. It was 
found that this method was not suitable for accurate work be- 
cause of irregular spacing and small line width (0.05 mm) which 
resulted in irregular and poorly visible Moiré fringes. 

Better results were obtained by photoengraving the lines on the 

1 Numbers in brackets designate References at end of paper. 

Contributed by the Metals Engineering Division and presented 
at the Metals Engineering Conference, Albany, N. Y., April 29-May 
3, 1959, of Tae American Society OF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 


of the Society. Manuscript received at ASME Headquarters, Febru- 
ary 24,1959. Paper No. 59—Met-7. 
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Use of the Moiré Effect to 
Measure Plastic Strains 


A method is described which should prove useful in the study of strain distribution in a 
metallic member during plastic deformation. Regularly spaced lines are drawn on a 
test specimen of interest and on a glass plate. 
photographed through the grid of lines on the glass. 
results in an interference pattern from which the amount of plastic deformation may be 


After exposure to strain the part ts 
The strain distortion on the part : 


metal specimen. This technique has been described by J. A. 
Miller [5]. A 6 X 6-in. Ronchi ruling negative with 133 lines 
per inch was successfully used. This produced consistently 
clean-cut lines, with a spacing error of about 0.2 per cent of the 
nominal value. Superimposing the original negative on the 
deformed photoengraved specimen provided sharp Moiré fringes 
which were easily photographed and analyzed. 


Strain Analysis 


The photoengraved specimen is ‘deformed under load in the 
plastic range as desired. At each incremental deformation step, 
pictures can be taken of the Moiré fringes after superimposing 
the master screen on the specimen, with the lines of the screen 
parallel to the original direction of the lines on the undeformed 
specimen. Strain measurements can then be made from the 
prints. To do this, a convenient co-ordinate system is chosen 
and co-ordinate lines are drawn on the pictures. The distance be- 
tween the interference fringes are measured along these co-ordi- 
nate lines in both directions. The strains and rotations of the 
deformed body are then derived as described for the following 
three conditions: 

(a) Strain perpendicular to the screen lines. A model, strained 
uniformly in tension in a direction perpendicular to its screen 
lines, with the master screen superimposed onto it without rota- 
tion, is shown in Fig. 1. 

To obtain the necessary interference fringes, every nth line of 
the undeformed master screen must interfere with every (n — 1)th 
line of the elongated specimen grid or with every (n + 1)th line ° 
of the compressed specimen grid. If we call a the distance 
between two neighboring screen lines, and d the distance be- 
tween two neighboring interference fringes, Fig. 1, the conven- 
tional elongation e is then expressed by 


na =d =(n — lja(l +e) 


hence e= 


In the case of compression 


d+a 


For small strains (a< d) this can be written: 


a 
= +— 
d 
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Fig. 1 


Elongation perpendicular to the screen lines 


a_ Lines perpendicular to load direction 
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Fig. 3 Compression test on ring 


It is clear that, in this case, the interference lines are parallel to 
the ruling and that the distance d must be measured perpendicu- 
larly to that direction 

It is not always possible to determine at once in every point 
whether the strain is elongation or compression. If, however, 
the sign of the deformation is known at one point, it is possible to 
determine the state of deformation in all neighboring points, be- 

cause, in a homogeneous material, the deformation cannot show a 

discontinuous transition from elongation to compression. The 


Test Specimen interference Pottern 


Fig. 2 


Rigid-body rotation of one screen 


b Lines parallel to load direction 
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sign of the deformation in this point can now be found by measur- 
ing with a microscope the distance between two or more lines. 
However, this sign is usually known a priori in at least one point. 
For instance, from the nature of the problem, it is known at the 
point of application of load. 
"  (b) Rigid body rotation of one screen. If the master screen is 
slightly rotated with regard to the undeformed screen on the 
specimen, interference lines appear, Fig. 2. 
If we call h the distance between two neighboring interference 
lines, then 


h 
tan @ 2 


in @ = 
h 


For small rotations this can be written 


The rotation-interference lines are slightly inclined (about a/2) 
on the grid and the distance h must be measured parallel with the 
screen lines of the test specimen. 


a Lines perpendicular to load direction 


(c) Homogeneous deformation. In order to determine, in plane 
strain, both principal strains €¢ and €, two orthogonal sets of 
screen lines are to be provided on the surface of the specimen. 
If we take orthogonal axes OX and OY, the screen lines on the 
master grid being parallel to OX, then the interference fringes 
are the contour lines of the displacement function v(z, y). It 
follows that, for small strains, 


oe 
oy 
if d, is the y-distance between two interference fringes, and 


h, 


if h, is the z-distance between two interference fringes. 

Similarly, the interference fringes of a grid parallel with the Y 
axis are the contour lines of the displacement u(z, y), from which 
it follows that 


b Lines parallel to load direction 


Fig. 4 Tensile specimen with hole 
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hence 
a 
as=-— 
h 
ou a 
oz d, 
eit if d, is the z-distance between two interference fringes, and 


= 
dy h, 
if h, is the y-distance between two interference fringes. 
We are thus able to determ'ne the three strain components ¢,, 
e,, and y,, = @, — a, for a homogeneous plane strain. The 
principal strains €, and €, and their direction @ are then given by: 


= “Valle, + e,) V(e, — + Yey'l 


and 


tan 20 = =. 


If the strain is not homogeneous, these formulas give the average 
values of the strain components in a small area, limited by the 
interference fringes. 

A more rigorous mathematical treatment of the plane homo- 
geneous deformation is given in the Appendix. 


IV Results of Tests 
A Compression Test on a Ring 


A mild steel ring, 75.0 mm outside, 20.0 mm inside diam, 
23.0 mm thick, and loaded in compression was the first specimen 
used to test the technique. 

A set of equally spaced lines, 0.2 mm apart, were scribed on the 
polished surfaces with the help of a small dividing machine. One 
set of the lines was in the load direction, the other one perpendicu- 
lar to it. The specimen was placed on a carriage and moved for- 
ward with a micrometer screw in stepsof0.2mm. A cutting tool, 
fixed on a sledge, drew a very sharp and regular trace, +0.05 mm 
wide, on the surface of the specimen. The spacing a between two 
lines, however, sometimes showed slight deviations, which resulted 
in an irregular Moiré pattern. 

After a compressive load of 36,800 Ib the interference fringes 
were obtained by superimposing a transparent screen with lines 
0.1 mm apart on the test specimen, Fig. 3(a and b). 

The picture shows that the deformation in the ring under the 
contact points is compression of high magnitude, accompanied 
by shear of two wedge-shaped volumes inward to the hole. 


B Tensile Test on a Mild Steel Bar With a Central Circular Hole 


The mild steel specimen used in this test, Fig. 4, was machined 
from a large hot-rolled '/;-in. plate. Its dimnensions in inches are: 
width, 3.480; thickness, 0.470; radius of hole, 0.314. The 0.2 
per cent yield strength, measured on a cylindrical test specimen, 
was 28,900 psi. 

Before the test, both surfaces were polished and provided with 
a 133 lines-per-inch ruling photoengraved onto the surface. One 
side had the lines perpendicular to the axial load, the other side 
had the lines parallel to the load direction. 

The bar was pulled in tension up to 58,200 lb (43,400 psi average 
on net section) and the corresponding interference pictures were 
taken, Fig. 4. 

A square grid was drawn on the prints and the distances be- 
tween the interference lines were measured in the z and y-direc- 
tions along these lines. These data permitted the calculation of 
the average strain between two interference lines, shown by the 
stepped line graph, Fig. 5. These original curves, when averaged 
to one quadrant, yielded the curves of equal elongation e,, Fig. 6, 
and equal compression ¢,, Fig. 7. The strain concentration, 
along z = 0, is defined by: 


e, at the edge of the hole 


0.250 
= 
Average e, across minimum section 


0.066 


This value may be much larger for smaller hole radii, and can 
be highly significant in the performance of strain-sensitive metals. 
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X- Direction 


Fig. 5 Percentage elongation in x-direction along co-ordinate lines A 


end D 


Fig. 6 Plastic deformation of tensile specimen with hole. Percentage 
elongation in x-direction. 
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Kawata [6] in his work with photoelastic coated specimens ob- 
tained a similar strain distribution for the case of a flat bar with a 
hole. 

It is interesting to note the distribution of the maxima in e, 
and e, on each side of the longitudinal center line of the bar. 
They reflect clearly the existence of a cross-shaped depression, 
that often is found in largely deformed flat tensile bars. 

Fig. 8 shows the variations of e,, e,, ande,forz = 0. Here, e, 
is measured by the reduction in thickness along r = 0. The 
constancy of volume is expressed by: 


=1 


We can calculate from this equation the value e,* by substitu- 
tion of the measured values e, and e,. The calculated e,* and 
the measured ¢,-values are shown in Fig. 8. Both values agree 
fairly well. 


C_ Tensile Speci With Notch 


A flat tensile specimen in mild steel had the following dimen- 
sions (in inches); width 2.000, thickness 0.052, notch radius 
0.0625, notch depth 0.1875. Before the test, one surface was 
polished and a 133 lines-per-inch screen photoengraved onto it 
The lines were perpendicular to the load direction. The speci- 
men was loaded until a crack appeared at the right notch; at this 
moment the load was 4800 lb or 51,900 psi average on the net sur- 


Load 


Fig. 7 Plastic deformation of tensile specimen with hole. 
elongation in y-direction. 
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Percentage Fig.8 Variations of ex, e,,¢:, and e,* along x = O in tensile specimen 
with hole 


face. The interference picture of the deformation is shown in 
Fig. 9. The strain curves along vertical lines drawn on the print 
were derived from the spacings between the interference fringes 
along these lines. These curves permitted us to plot the lines of 
equal e, elongation on the specimen, Fig. 10. 

An interesting feature of this strain distribution is the low 
deformation region at the center of the specimen, which is 
bounded by areas of higher elongation. 
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APPENDIX 
Strain Components «,, «,, y 


We have shown how we can visualize the plastic deformation in 
a body and how we can deduce the principal strains in the case of 
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an infinitesimal deformation. In the case of a finite homogeneous 
plain strain, the second-order terms can no longer be neglected 
and other formulas are to be used. 

We provide two orthogonal sets of screen lines on the surface of 
the body before deformation, one in the z-direction and one in 
the y-direction, Figs. 11 and 12. The Moiré pattern is obtained 
by using a fixed screen having a similar set of screen lines parallel 
with the z and y axes, which are fixed and do not follow the de- 
formation. 

From the Moiré fringes one measures the following quantities: 


= y-distance between two neighboring fringes 
a-distance between two neighboring fringes 
angle between Oz and fringes 
x-distance between two neighboring fringes 
y-distance between two neighboring fringes 
angle between Oy and fringes 

The real strain is: 
OB’ — OB OA' —OA 


while we may call “apparent strain,’’ the quantities: 


OB" — OB. OA” — OA 


OA 


The apparent strain e, and e, are calculated from the measured 
quantities d, and d,. 


Fig.9 Tensile specimen with notches 
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Fig. 10 Plastic deformation of notched tensile specimen at the moment 
of fracturing. Percentage elongation in x-direction. 


In the case of extension 


and = 


d,-—a 


d,-a 
In the case of compression 


d,+a 


—a 
and ¢ = 
The unit shear [7] in point O, Fig. 13, is defined by: 
y = tan y = tan (a, — a,) 


This can be calculated as follows: 


Consider the triangle RST, Fig. 11. We obtain: 


tan a, tan B, 


tan a, = 


In the case of extension we have: 
h, = = (6) 
From equations (3), (5), and (6) it follows that, in any case: 
tana, = —e, tan 8, (7) 
Similarly, we obtain: 
tana, = —e, tan 8, (7a) 
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Fig. 11 
Screen lines originally parallel with the x-axis. 


Moiré fringes caused by a finite homogeneous deformation. 


When we have calculated e,, ¢,, a,, and a@,, we possess all the 
elements we need to construct the rhomboid OA‘C’B’ obtained 
by distortion of the square OACB, Fig.13. Itis therefore possible 
to calculate €,, €,, Y and the rotation w from these four quantities. 

Let us assume that the transformation from OACB to OA‘ C’ B’ 
is brought about by the linear vector function B so that the new 
co-ordinates x’ are given 


x’ = Bx (8) 
Ly’ Wom bal” 


This means that 
2’ = bux + and 
y’ = bar + bay 


The co-ordinates of point B” are obtained by substituting z = na, 
y’ = Oin (9), (OB = na). 


bubs — dubs |B] 
bea baa 


If we express OB” as a function of the apparent strain, we find 


(9) 


OB’ = (10) 


1+e 


and in the same way, we find 


(11) 
l+e= 
The co-ordinates of point B’ are found by putting y = 0 in equa- 
tion (9). 
= bur 
y’ = but 
Then 


tan a, = = 
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Fig. 12 Moiré fringes caused by a finite homogeneous deformation. 
Screen lines originally parallel with the y-axis. 


Ox , Oy fixed orthogonal axes 
OX, OF directions of principal strains in the undeformed axes 
OX/, OY;' directions of principal strains after deformation 


Fig. 13 Finite homogeneous plane deformation 


and similarly 


The equations (11) and (12) yield 


The real strains €, and €, can now be calculated in terms of bu, 


biz, ber, and bee. 


y 


(12) 


—bis 


l1+e, 
1 + tan a, tan a, 


bu = 


1+«¢ 
1 + tan a, tan a, 


be = 


(13) 


—tan a,(1 + ¢,) 
1 + tan a, tan a, 
tan a,(1 + e,) 

1 + tan a, tana, | 


bu = 


ba = 
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The co-ordinates of point B’ are obtained by putting z = na, 
y = 0 in equation (9) (OB = na). 


= buna 


y’ = buna 


OB' = V bu? + ba? na 


The real strain €, is obtained from 
(1 + ¢,) ( 0 
Similarly 
(1 + €,)? = + 


Substitution of equation (13) in (14) yields: 


1 + tan* a, 
(1 + tan a, tan a,)? 


(1 + e,)? = (1 + e,)? and 


1 + tan’ a, 
(1 + tan a, tan a,)* 


(1 + ¢,)* = (1 +¢,)* 


1+e, = (1 + e,) and 


cos (a, — a@,) 


cos 


If only one grid of parallel lines is scribed on the specimen, the 
Moiré pattern allows us to calculate, for instance, e, and a,, but 
not €,. 

The Moiré fringes can be brought nearer to each other, when 
we give the master screen a rotation a with respect to its original 
parallel position. A rigid body rotation: w = a,¢, = ¢€, = y = 0, 
results in an apparent strain e,’ deduced from (16): 


(17) 


08 
If the body afterward undergoes an additional homogeneous 
plane deformation, then the values €,, €,, and ‘y, can still be cal- 
culated from (16) and (4). When only one set of parallel lines 
is drawn, the measured apparent strain ¢,” can be corrected with 
the additional e,’ by taking: 


= — €,' (18) 


Principal Strain-Components and Directions 


The transformation expressed by the linear vector function B 
can be separated into a rigid-body rotation R and a pure strain 


B = RC (19) 


—sin w 


cos 


cos len 


Cas 


bu 
bu 


\|sin w 


with cy = Cy. 
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This requires: 
ba — bis 


tan = 
bu + bx 
or from equation (13): 


(1 + e,) tan a, + (1 + ¢,) tan a, 
(1 + e,) + (1 + ¢,) 


tanw = 


We thus find the numerical value of the rotation. 

The symmetric linear vector function C, expressing a pure 
strain without rotation, allows us to calculate the principal strain 
direction and the principal strains ¢€, and 

If we change from the fixed orthogonal axes zOy to the fixed 
orthogonal axes XOY, Fig. 13, the angle between Oz and OX 
being 6, the linear vector function B is replaced by B’, given by: 


B’ = UBU* (22) 
with: 
cos@ —sin 


U = , U 


= | cos sin 
isin 8 


|—sin cos 

One verifies, with the aid of the first formula (21), that this leaves 

the rotation w unchanged, and that 
B’ = RC’ 

where (23) 


Cc’ = UCU* 


R is the same as in (19) and U is the same as in (22). 
to choose U (or @) such that, in the new axes XOY: 


It is possible 


If this is the case, the corresponding directions OX and OY are 
called principal directions. For the components of the symmetric 
linear vector function C, (4) and (13) require that: 


c's = c'n = 0 


Substituting these values into the second equation (23) one finds: 


tan 20, = 
Ca Cu 


(24) 


Solving (19) for the components of C, and substituting this in 
(24): 
babu + 


tan 26, = 2 
—bu* + + — 


and finally, from (13): 
(1 + + e,(tan a, — tan az) 
(1 + e,)%1 + tan* a,) — (1 + e,)%(1 + tan’ a,) 
(25) 


The direction corresponding to 0 < 6; < 7/2 is the first principal 
direction OX,, Fig. 13. 
The principal strain €, can be deduced from (14): 


1+ 6 = cy’ 


by calculating cy’ from (19) and (23). The result is more readily 
obtained by considering the triangle OQB, transformed into 
0Q'B’, Fig. 13, and writing: 


OQ"? = OB" + — 20B'-B'Q’-cos OB'Q’ 


tan 20, = 2 


(26) 
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(21) 
4 
(14) 
| 
(15) 7 
or: 
or: 
! 


OB? BQ? 


Observing that = 


00 = 


and dividing both sides of (26) by this quantity, one obtains: 


(1 + &)? = (1 + €,)? cos? 0, + (1 + €,)? sin? A 
+ 2(1 + €,)(1 + €,) cos 0, sin @; sin (@, — a,) (27) 


The principal strain €, in the direction OY, is found to be: 
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(1 + = (1 + €,)* + (1 + €,)? sin? 
— 2(1 + €,)(1 + €,) cos A sin 0, sin (a, — a,) (28) 
One readily verifies that: 
(1 + 4)? + (1 + &)? = (1 + €,)? + (1 + €,)* 


which replaces €; + €: = €, + €, which is valid for infinitesimal 
strain. 
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R. M. MAINS 


Consulting Engineer, Knolis Atomic 
Power Laboratory, Schenectady, N. Y.? 


A Generalization of Cumulative Damage 


In the application of various damage accumulation hypotheses to shock and vibration 


problems, a pattern of results was observed. A plot of allowable response (for the largest 
of a sequence of excitations) versus the number of excitations in the sequence produced 
straight lines on log-log paper for each value of damping. It seemed likely that a gen- 
eralization could be found which would relate the allowable response to the largest 
excitations with a load-distribution factor, a material-and-structure factor, and the 
material constants from fatigue tests. One form of this generalization is presented, to- 
gether with some numerical examples of its use, and a discussion of its possible sig- 


nificance. 


Introduction 


N THE COURSE OF SEVERAL APPLICATIONS of damage 
accumulation hypotheses to shock and vibration problems [1],* 
there seemed to be a consistent pattern for the results. In each 
case the results could be presented as a graph of the allowable 
maximum stress (or plastic strain) response versus the applied 
number of shocks (or vibration sequences) for various values of 
structural damping. Changes in the distribution of load am- 
plitudes shifted these curves asa group, also. As a consequence 
of these observations, it seemed possible that a generalization 
of damage accumulation could be made somewhat as follows: 


Total damage accumulated = (amplitude of maximum response 
factor) X (distribution of load amplitudes factor) 
X (material and structure factor) (1) 


If such a generalization could be made, then the writing of speci- 
fications and the design processes for shock and vibration could 
be greatly simplified and, at the same time, made more rational. 
It might even be possible to include factors for the damage con- 
tributed by other loads, such as thermal expansion, pressure 
cycles, mechanical loads, and the like. The following develop- 
ment leads to one form of such a generalization. 


Fatigue Test Data 


In order to begin, let us consider presently available relation- 
ships which could be put together to form the various factors in 
equation (1). First, it is reasonably well established [2, 3] that 
fatigue tests of steels give the relationship: 


1 Legal notice: This report was prepared as an account of Govern- 
ment sponsored work. Neither the United States, nor the Com- 
mission, nor any person acting on behalf of the Commission: 

(a) Makes any warranty or representation, express or implied, with 
respect to the accuracy, completeness, or usefulness of the informa- 
tion contained in this report, or that the use of any information, 
apparatus, method, or process disclosed in this report may not in- 
fringe privately owned rights; or 

(b) Assumes any liabilities with respect to the use of, or for damage 
resulting from the use of any information, apparatus, method, or 
process disclosed in this report. 

As used above, “‘person acting on behalf of the Commission”’ in- 
cludes any employee or contractor of the Commission to the extent 
that such employee or contractor prepares, handles or distributes, or 
provides access to, any information pursuant to his employment or 
contract with the Commission. 

2 Operated for the United States Atomic Energy Commission by 
the General Electric Company under Contract No. W-31-109 Eng-52. 

? Numbers in brackets designate References at end of paper. 

Contributed by the Metals Engineering Division and presented 
at the Metals Engineering Conference, Albany, N. Y., April 29-May 
3, 1959, of Tue American Society or MECHANICAL ENGINEERS. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, Jan- 
uary 26,1959. Paper No. 59—Met-1. 


Journal of Basic Engineering 


2 


LOG. y (STRESS OR PLASTIC STRAIN) 


LOG N (NO. CYCLES TO FAILURE) 
Fig. 1 Fatigue diagram 


yN* = C; (2) 
as shown in Fig. 1, in which 


y = stress for stress fatigue or plastic strain for strain fatigue 
N = number of cycles to failure at y 

k = '/, for stress fatigue, or '/; for plastic strain fatigue 
C; = constant for each material 


Structural Response to Transient Loads 


Second, it is well established [4] that elastic structural responses 
to transient loading of a structure have the form: 


y = sin wt + yo(t) (3) 
as shown in Fig. 2, in which 


= stress or strain response 
= critical damping ratio 
= natural frequency of the responding mode 
= initial amplitude, a function of the load 
w/V1 
stress or strain response to steady load or slowly (com- 
pared to 27w;) varying load 


If y in equation (3) is not large enough to involve yielding of the 
material, then y, C2, and yo may represent either stresses or strains. 
If y does involve yielding, then Equation (3) is not exact because 
the rate of amplitude decay is likely to be faster than e ~*~ would 
provide. However, if the amplitude decay curve for y can be 
measured or calculated, then a value of fw» can be found which will 
satisfactorily approximate the decay curve. (A good rule of 
thumb is to take & as the reciprocal of 2 times the number of 
cycles required to decay to 5 per cent of the initial amplitude.) So 
for the purpose of this generalization, we will assume that y in 
equation (3) can also represent plastic strain responses. 
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Damage Accumulation 


Third is the matter of damage accumulation, which is not we!l 
established at all. The assumption of linear damage accumula- 
tion (Palmgren 1924, Serensen 1944, Miner 1945) usually called 
Miner’s hypothesis [5] has the form: 

n; 
in which 

N, = the number of cycles to failure at a particular stress or 

strain for the virgin material 

n; = the number of cycles applied at that particular stress or 

strain (equal to one for individual cycles) 
6 = the proportion of damage accumulated in a sequence of i 
groups of stresses or strains. 


Miner’s hypothesis was that failure occurred at 6 = 1, but more 
recent work [6, 7, 8] shows that failure occurs for values of 6 as 
small as 0.1 and as large as 10. The correct value to use for 6 de- 
pends upon a number of factors which are still being investigated, 
so we shall proceed with deriving the form of the generalization 
and determine the importance of a specific value for 6 when we 
see how it affects the results. 


Stress or Strain Range per Cycle of Response to Transients 


To put the available elements from equations (2), (3), and (4) 
together, consider the range of amplitude covered by each cycle of 
equation (3). In Fig. 2, this range is the difference between the 
positive peaks, (1), (3), and so on, and their succeeding negative 
peaks, (2), (4), and so on. Mathematically, this difference is the 


difference between 1, at wt = my (4r — 1) and 


Y—-. at wt = = (4r — 3). 


This is 
4r-3 ) 


2 
Ve = =e 


Cy sin ~ (4r — 3) r= 1,2,3... 


PER CYCLE DETERMINES § 


Fig. 2 Transient response curve 
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Ye = C2 + 


4r-—3 


Note that r is the number of the cycle of stress or strain in the 
particular series of stresses or strains resulting from the applica- 
tion of one transient load. 

In order to determine how many cycles in the series are of 
significance, consider the ratio of yg forr = land forr = 


Then for various values of &, 


gE 1/2 1/4 1/8 1/16 1/32 1/64 
yit/y: 0.0268 0.00470 0.00342 0.00263 0.00227 0.00207 


and it seems reasonable to say that carrying r out to 1/£ will cover 
the significant s'resses or strains in the damage accumulation. 


Damage Accumulation per Transient Load 


Now let us hold equation (6) in mind while we look again at 
equation (2), which can be rewritten as 


1/N = (y/C,)“* (8) 


If we consider further the portion of damage contributed by a 
single cycle in equation (4), 


6; (9) 


and substitute equation (8) into equation (9), we get: 
1 


(+) 


Now substitute equation (6) into equation (10) and obtain: 


4r -—3 
exp {| — Vi-# (11) 


as the portion of damage contributed in each individual cycle r of 
the damped train of response to a transient. If we agree to ignore 
the cycles after r = 1/£, then the sum of the damage contributed 
by the train of cycles of response to a single transient becomes: 


fs 
= 


This can be rewritten as: 
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Ye = exp{ —§ — 
4r—1 
; + exp (-¢ (5) 
2V1-# 
| 
t 1 1 
—rt = 
@ | r=1,2,3. 
27m, 


TO 


5, 


DAMPING RATIO- & 


1 


io"! 


10 
MATERIAL AND STRUCTURE FACTOR-M 
Fig. 3 


The term in brackets in equation (13) depends only upon &, k, 
and how many cycles r are counted, so that it is clearly a factor 
dependent upon the material and the structure. Let us then de- 
fine a material and structure factor M as: 


M = 


and consider it for use in equation (1). Fig. 3 shows a plot of M 
as a function of the critical damping ratio — and various values 
of k. 


Damage Accumulation From a Series of Transients 
Next, let us rewrite equation (13) as: 
1 


i, - u (2) (15) 


and recall that 6, is the portion of damage accumulated by the 
train of cycles of response to one transient. To determine the 
effect of a series of transients, we can now let C; take on a series of 
values, each of which is the initial response to a different transient. 
Then the sum of the damage accumulated in m such transients is 


1 
= MX), (16) 
1 


in which C2» is the initial response for the mth transient. This 
equation can be rewritten as 


1 
Con 
= Com) (17) 


in which Cmax is the largest initial response in the series of 
transients. Since Cmax/C; is a constant factor, it can be brought 
outside the summation to give: 


1 
Cmax Com 
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The part left within the summation is dimensionless, and depends 
only upon the ratios of the initial responses to the maximum initial 
response, the number of transients, and the constant k from equa- 
tion (2). Consequently, we can now think of the summation 
term in equation (18) asa load distribution factor D and define it 
as follows: 


and consider it for use in equation (1). 


Tabie 1 Load amplitude distributions for various total numbe:s of loads 


(Normal probability or Gaussian distributions) 


Intensity Number of loads in intensity interval for total 
number n 
n = 200 n = 500 n = 1000n = 2000 n = 5000 


0 44.3 122.1 258.5 545.8 1453.0 
10 40.9 110.1 232.2 482.6 1264.0 
20 35.0 91.1 187.0 380.6 961.5 
30 27.6 68.6 135.4 263.4 638.0 
40 20.2 47.0 87.8 164.8 369.0 
50 13.6 29.3 51.4 90.0 186.5 
60 8.5 16.6 26.8 43.4 81.5 
70 5.0 8.6 12.6 18.8 31.8 
80 2.6 3.9 5.3 7.2 10.9 
90 1.3 1.7 2.0 2.4 2.8 
100 1.0 1.0 1.0 1.0 1.0 


Effect of Factor D 


In order to provide some feeling for the effect of factor D, its 
value was computed for a series of normal probability (Gaussian) 
distributions of various total numbers of transient loads. The 
amplitude distributions for the various series are given in Table 1, 
and a plot of D versus total number of these transient loads for 
three values of k isshown in Fig. 4. (In addition, it is worth while 
to observe that, for a steady sinusoidal load, D becomes simply m 
while the material factor M becomes unity. For a load applied 
only once slowly and removed slowly, both D and M become 
unity.) 


Amplitude Factor 


Equation (18) can now be rewritten as: 
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as = 
| 
= 
1 1 
r@1,2,3...- 
C. k 
4-3 D= (=) (19) 
xpi (14) / max 
ry 


NUMBER OF OCCURRENCES 


— 


FOR GAUSSIAN DISTRIBUTION 
OF LOAD AMPLITUDES 4 


“4 


— 


io! 10 
LOAD DISTRIBUTION FACTOR, D 
Fig. 4 


|~ 


C k 
i, = xDXM (20) 

In this equation, Cmax is the maximum initial response, and C, 
and k are material constants from equation (2). This term is 
again dimensionless and qualifies as an amplitude of maximum 
response factor A defined as follows: 


1 


* 
A= (=) (21) 


This completes our search for terms to put in equation (1), which 
now reads: 


=AXDXF (22) 


Recapituiation 


To recapitulate before exploring further, let us examine equa- 
tion (22): 


1 The material-and-structure factor M is a constant, linear 
multiplier once the exponent k in equation (2), the damping & in 
equation (3), and the number of cycles to be counted in each 
transient response r have been determined. 

2 The load-distribution factor D is a constant, linear multi- 
plier once the number of separate transient disturbances m their 


2m 
shape distribution a -, and the exponent k in equation (2) are 


max 


determined. 
3 The amplitude-of-maximum-response factor A is dependent 


only upon the largest initial response Cmax and the constants C; 
and k are established in equation (1). 

4 The damage accumulation number 67 represents the same 
thing as 6 in equation (4), and should indicate failure for some 
value, presumably near 1. 


Inclusion of Other Kinds of Loads 


Equation (22) is then an answer to our quest as set forth at the 
beginning, and as defined verbally in equation (1). Suppose we 
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are concerned with several different kinds of excitation each acting 
at separate times, such as shock, vibration, thermal cycles, pres- 
sure cycles, and the like. Then equation (22) would become: 


Biot = (A, X X MY (23) 


in which the subscripts i denote the specific values of the various 
factors appropriate to each of the different kinds of excitation. 
We can go further by normalizing equation (23) and divide each 
A; by the maximum value A max which gives: 


A 
= Ames), (; Dy Xx us) (24) 
1 


Amex 


4 Cmax 
max 


from equation (21), equation (24) can be rearranged and solved 
for Cmax to give: 


Cmax = CrB rot" [= (25) 


Now since 


If we establish a limiting value for 1 in equation (25), the equa- 
tion then expresses the allowable maximum initial response Cmax 


thus: 
Ay 
= ( [= ry x D; x | (26) 


max 


which could be used as a design limit. 


Precautions and Significance of Factors 


In using equation (23) as an evaluation tool or equation (26) as 
a design tool, it should be noted that, if various kinds of loads are 
applied simultaneously, then their combination into cycles of 
various sorts must be done before the equations are applied. 
Such a procedure does not alter the validity of the generalization, 
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and it still provides a simplification over any other way known to 
the author to include the same considerations in an analysis. 
The validity of the generalization as such, however, must be 
checked against available service or test data before it can be 
accepted as correct. 

The significance of the various factors in equation (26) would 
seem to be as follows: 


1 The allowable total damage accumulation is a factor raised 
to the kth power. The significance of d51im in affecting Cusx may 
be judged by: 


k=0.1 
0.887 
0.927 
0.978 
1.000 


Siim* for— 
0.2 
0.717 
0.859 


0.956 
1.000 


0.5 
0.548 
0.707 
0.895 
1.000 


in which we see that Cusx is not particularly sensitive to d1im. 
This may help to explain why large variations in djim are found 
experimentally. 

2 Since the linear combinations of A,, D;, and M; are summed 
and raised to the —k power, and since & is less than 1.0, these fac- 
tors are magnified in importance. Hence their proper determina- 
tion is of considerable significance in establishing Cnax. 

3 The material constants C; and k are of primary importance 
in establishing Cusx, a8 they should be. 

4 Any term in equation (26) is eligible for the application of a 
safety factor, depending upon which uncertainties are to be so 
covered, and the resulting effect on Cusx can be evaluated from 
the equation. Safety factors applied to A,, D,, or M; are mag- 
nified in effect, while for dim the effect is diminished. Where 
such safety factors should be used is a matter for the individual 
problem. 


Effect of Mean Stress 


In reviewing this generalization, Dr. W. E. Cooper‘ pointed out 
that the effect of mean stress (or strain) on fatigue life had not 
been included, and suggested that it could be accounted for as 
follows: 

Mean stress (or strain) affects the fatigue life at a particular 
alternating stress (or strain), the same as if some different al- 
ternating stress (or strain) alone had been applied. That is to 
say that a combination of alternating stress (or strain) and mean 
stress (or strain) can be replaced by an equivalent alternating 
stress (or strain) which has the same damage effect as the com- 
bination. In equation (3), let C; be replaced by C,*, the equiva- 
lent alternating stress (or strain) with corresponding changes in 
Com in equation (16) and Cmax in equation (17). Then 


= S(Ymenn, Yo) (27) 
in which 

initial amplitude in equation (3) 

mean stress (or strain) parameter 

material parameter 


some function which produces the proper rela- 
tionship between C; and C,* 


C;* = equivalent alternating stress (or strain) 


S(Ymean, Ye) 


When the actual combined mean and alternating stresses stay 
within the elastic range, and the modified Goodman diagram [10] 


* Consulting Engineer, Knolls Atomic Power Laboratory, General 
Electric Company. 
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ENDURANCE LIMIT FOR 
ALTERNATING STRESS 
(STRAIN) 


LIMIT FOR ELASTIC ACTION 


ALTERNATING STRESS 


~~. 


“wp --- 


Ys 


MEAN STRESS —~> 
Fig. 5 Modified Goodman diagram for equivalent alternating stress 


Notes: Ordinates and abscissas must have same units. 


Area OAB can be stress or strain. 
Area outside OAB must be strain or “fictitious elastic stress." 


is used for defining the effect of mean stress on fatigue life, then 
equation (27) becomes (see Fig. 5): 


-1 
Vs 
in which 


Ymeen = actual mean stress 
C; = actual alternating stress amplitude 
Ys = ultimate stress for the material 


(28) 


When strains are used instead of stresses, equation (28) still 
holds, with appropriate interchanges of strain for stress in the 
definitions. In this case, the strains need not be restricted ‘o the 
elastic range, but may include plastic strains as wel. 

For cases in which the combined mean and alternating strains 
do not exceed 3 or 4 times the yield strain, then equation (28) can 
be used with the various parameters, particularly the mean stress 
parameter, defined as “fictitious elastic stresses” in the manner 
proposed by Langer [11]. 

In any of these cases, and whatever relationship is used to de- 
fine the effect of mean stress on fatigue life, the results in terms of 
this generalization are: 


1 Com and Cmax in the load distribution factor, equation (19), 
are replaced by C2,* and Cmax*, the equivalent alternating 
stresses (or strains). 

2 Cwax in the amplitude of maximum response factor, equa- 
tion (21), is replaced by Cmax*. 

3 Coax the allowable maximum initial response, equation 
(26), becomes Cusx*, the equivalent alternating response value. 


Numerical Illustrations 


To provide some numerical illustrations of this generalization, 
the following forms of equation (2) were taken from the literature: 
For carbon steel pipe [2] 


= 245,000 
For stainless steel pipe [3] 
e, N'/* = 0.35 
For butt welds in structural steel plate [9] 


(28) 


(29) 
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ao N‘/* = 150,000 (30) 


It was then assumed that it was desired to find the allowable 
maximum stress or strain response to a series of shock loads 
according to equation (26). In each case a normal probability 
distribution of shock load intensities was assumed, as in Table 1, 
with varying numbers of occurrences, and with various damping 
ratios. In addition, the allowable value of 6)im was taken arbi- 
trarily as 1.0 in keeping with Miner’s hypothesis. Other values for 
diim Would have the effect of multiplying the results for Cuax by 
the appropriate factor from the table in the section on Precau- 
tions and Significance of Factors. The results are given in Tables 
2, 3, and 4. 

These results show that, for 1000 total shocks and a structure 
and material with an effective damping of £ = 1/1, the allowable 
response to the maximum shock would be: 


82,300 psi for the material of equation (28) 
0.024 strain for the material of equation (29) 
77,000 psi for the material of equation (30) 


If dyim had been taken as 0.8 instead of 1.0, these values would 
have been reduced by 4.4, 10.5, and 2.2 per cent, respectively. 
With either value of dim, the allowable response to maximum 
shock would be much greater than the usual values associated with 
yield stress limits. 


Table 2 Results of applying equation (26) for a Gaussian distribution of 
shock loads to o X N'/s = 245,000, diim = 1.0 


No. of 
shock Dist. Damp. Material 
loads factor ratio factor 


200 6.82 1/2 0.0229 


1/32 31 
1/64 61 73,000 

500 11.14 1/2 0.0229 322,000« 
1/4 0.825 158,000 

1/8 4.98 110,000 

1/16 14.25 89,000 

1/32 31.3 76,000 

1/64 61.7 66 , 000 
1000 16.4 1/2 0.0229 298 , 000 
1/4 0.825 146, 

1/8 4.98 102, 000 

1/16 14.25 82,000 

1/32 31.3 67,000 

1/64 61.7 61,000 

2000 25.2 1/2 0.0229 273, 000¢ 
1/4 0.825 134, 000° 

1/8 4.98 93 ,000 

1/16 14.25 75,000 

1/32 31.3 65,000 

1/64 61.7 56,000 

5000 46.9 1/2 0.0229 242,000« 
1/4 0.825 98, 000 

1/8 4.98 82,000 

1/16 14.25 67,000 

1/32 31.3 57,000 

1/64 61.7 


* Should be treated as strain fatigue problem. 
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Table 3 Results of applying equation (26) for a Gaussian distribution of 
shock loads to €p X N’/t = 0.35, diim = 1.0 


No. of shock Dist. Damp. Material 
loads 


factor ratio factor 
m D M 
200 25.4 1/2 0.220 0.148 strain 
1/4 0.734 0.081 
1/8 2.27 0.046 
1/16 4.53 0.033 
1000 92.6 1/2 0.220 0.078 
1/4 0.734 0.042 
1/8 2.27 0.024 
1/16 4.53 0.017 
5000 365 1/2 0.220 0.039 
1/4 0.734 0.021 
1/8 2.27 0.012 
1/16 4.53 0.009 


Table 4 Results of applying equation (26) for a Gaussian distribution of 
shock loads to X = 150,000, = 1.0 


200 3.36 1/8 23.7 97 ,000 psi 
1/16 154 80,000 
1/32 447 72,000 
1/64 996 67 ,000 

1000 5.10 1/8 23.7 93 ,000 
1/16 154 7,000 
1/32 447 69,000 
1/64 996 

5000 9.00 1/8 23.7 88,000 
1/16 154 73,000 
1/32 447 65,000 
1/64 996 60,000 


References 


1 R. M. Mains, “An Application of Accumulative Damage 
Criteria,” 25th Shock and Vibration Bulletin, Part II, 1957. 

2 A. R. C. Markl, “Fatigue Tests of Piping Components,” 
Trans. ASME, vol. 74, 1952, pp. 287-303. 

3 E.E. Baldwin, G. J. Sokol, L. F. Coffin, ‘Cyclic Strain Fatigue 
Studies, on AISI Type 347 Stainless Steel," Proceedings, ASTM, vol. 
57, 1957, pp. 567-586. 

4 8. P. Timoshenko, “Vibration Problems in Engineering, " Van 
Nostrand and Company, New York, N. Y., 1937, p. 32 ff. 


5 M.A. Miner, “Cumulative Damage in Fatigue,”” Journal of 
Applied Mechanics, vol. 21, Trans. ASME, vol. 67, 1947, p. 159. 

6 H.T.Corten and T. J. Dolan, “Cumulative Fatigue Damage,” 
International Conference on Fatigue of Metals, session 3, paper 12, 
I. Mech. E. and ASME, 1956. 


7 A. M. Freudenthal, “Cumulative Damage Under Random 
Loading,” International Conference on Fatigue of Metals, session 3, 
paper 4, I. Mech. E. and ASME, 1956. 

8 E. Gassner, “Effect of Variable Load and Cumulative Damage 
on Fatigue in Vehicle and Airplane Structures,’’ International Con- 
ference on Fatigue of Metals, session 3, paper 10, I. Mech. E. and 
ASME, 1956. 

9 W.M. Wilson and A. B. Wilder, ‘‘Fatigue Tests of Butt Welds 
in Structural Steel Plates,” Bull. 310, University of Illinois Engineer- 
ing Experiment Station, 1938. 

10 H. J. Gough, “The Fatigue of Metals,’’ London, England 
1924. 
11 B.F. Langer, ‘Design Values for Thermal Stresses in Ductile 


Materials,”” Welding Journal Research Supplement, September 1958, 
pp. 411-417. 


Transactions of the ASME 


= 
i 
| 255,000 
174,000 
1/16 14.25 98 , 000 
@ 


Effect of Relaxation on the Behavior 
of Materials Under Combined 
Alternating and Static Stress 


The effect of temperature on the stress range diagram is discussed and the particular 
characteristics of the curves for unnotched and notched specimens are analyzed. Ex- 
cluding metallurgical factors from consideration, it is suggested that relaxation is the 
principal mechanism which influences the behavior of polycrystalline metals under 
combined alternating and static stress. Two parallel nonlinear Maxwell units are 
used to represent the relaxation mechanism at elevated temperature. An analysis of this 
model shows that relaxation occurs to an asymptotic finite value of stress which is a 
function of the initial stress. The same model is applied for representing the behavior 
in the stress range diagram with the assumption that a linear relationship exists at low 
temperature between the fatigue strength and the normal stress acting on the planes of 
reversed slip. 


F. H. VITOVEC 


: Associate Professor 

of Metallurgical Engineering, 
Department of Mining and Metallurgy, 
University of Wisconsin, Madison, Wis. 


Introduction 


HE RAPID DEVELOPMENT of flight vehicles in recent 
years has led to a demand for more intensified study of the 
fatigue behavior of materials at both high and low temperatures. 
Although most of the fatigue data obtained are commonly com- 
parisons of the performance of rather complex materials under 
some closely specified conditions, comparison of the data permits 
observation of some apparently basic patterns with regard to 
effect of temperature and stress concentration as expressed by the 


The effect of temperature on the behavior of heat resistant 
alloys under combined alternating and static stress ie sche- 
matically illustrated by the stress range diagrams for unnotched 
and notched specimens of Fig. 1. The curves represent the be- 
havior of materials in temperature ranges where metallurgical 
reactions are assumed to be negligible and for stresses which re- 
sult in a relatively long life for the specimens. Two types of 
stress range diagrams are shown for both unnotched and notched 
specimens. For the diagrams to the left in Fig. 1 the ordinate 
indicates the alternating stress, and the abscissa the mean stress. 


~e stress range diagram. Each curve gives the combination of alternating and mean 


Contributed by the Metals Engineering Division and presented 8ttess which results in failure after a definite number of hours or 
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Fig. 1 Schematic stress range diagrams for unnotched and notched specimens 
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each other. The three lines in each diagram represent the curves 
for three different test temperatures, where 7’, is a relatively low 
temperature, 7, is one intermediate temperature, and 7’; is as- 
sumed to be a high temperature which is close to the temperature 
range for recrystallization. 

The schematic diagram for the unnotched specimens shows a 
nearly linear relationship between alternating stress and mean 
stress at low temperature. As the temperature increases the re- 
lationship becomes nonlinear. At high temperature the curve 
shows two ranges, one in which the static mean stress has rela- 
tively little influence on the allowable alternating stress ampli- 
tude, which is followed by a range in which the strength is pri- 
marily determined by the rupture stress and the influence of the 
superimposed alternating stress is small. 

In the dimensionless stress range diagram for unnotched speci- 
mens the relationship between alternating and mean stress is also 
linear for low temperature. With increasing test temperature 
the curves lie increasingly above the straight line. 

The stress range diagram for notched specimens indicates three 
stages which become more pronounced at elevated temperature. 
Small superimposed mean stress decreases the allowable alter- 
nating stress amplitude significantly in the first stage. In the 
second stage a further increase of static mean stress from a cer- 
tain value of mean stress has relatively little effect on the al- 
lowable stress amplitude, until in the third stage at large mean 
stresy the stress amplitude decreases more or less rapidly to zero. 
The rate of decrease of the stress amplitude in the third stage in- 
creases with increasing temperature. This trend is more sig- 
nificantly apparent from the dimensionless stress range diagram. 
At small mean stress the curves lie significantly below the straight 
line. At intermediate alternating to mean stress ratios a more or 
less pronounced step can be observed which corresponds with the 
second stage and the curve changes to positions above the straight 
line. 

The described pattern was exhibited in more or less pronounced 
form by the alloys 8-816, Stellite 31, Waspalloy, Inconel X-550, 
16-25-6 Timken, GMR-235, SAE 4340, N-155 [1-5].! In many 
instances, especially for notched specimens, the test data com- 
monly procured were not sufficient to show the various charac- 
teristic regions of the stress range diagrams. 

In the following paragraphs various mechanisms are discussed 
which may explain the observed behavior, and a model is sug- 
gested which permits analysis of the stress range diagram at ele- 
vated temperature under consideration of creep. 


Analysis of the Stress Range Diagram 


The previous discussion indicated that the effect of mean stress 
on the alternating stress amplitude decreases with increasing tem- 
perature. This effect may be caused by (a) increasing recovery 
from fatigue damage with increasing temperature, (b) increasing 
retardation of slip due to strain rate or frequency effects, and (c) 
relaxation. Metallurgical reactions may strongly affect the be- 
havior; however, these are considered to be superimposed to the 
mechanisms mentioned previously and are therefore excluded 
from consideration in this paper. 

All three mechanisms, i.e., recovery, strain rate effects, and re- 
laxation, occur simultaneously and contribute to the observed 
behavior in different degrees. 

Several studies were concerned with recovery from fatigue 
damage during intermittent heating [6, 7]. According to these 
investigations recovery may occur only very early during the 
fatigue life. 

Strain rate effects introduced by alternating stress may cause 
retardation of dynamic creep and stress rupture. This retarda- 
tion increases with increasing test frequency and/or increasing 


1 Numbers in brackets designate References at end of paper. 
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temperature [8]. However, experimental studies and mathe- 
matical analysis indicate that this mechanism does not cause a 
large enough effect to explain the behavior as discussed in the in- 
troduction. 

Relaxation in unnotched specimens may be explained by the 
following mechanism. In the large cycle fatigue range of poly- 
crystalline specimens fatigue occurs in few localized regions, i.e., 
individual grains which are subjected to reversed shear. Re- 
versed shear produces “‘fine slip’’ and initiation of cracks which 
spread, join, and produce the fatigue fracture. These fatigue 
affected grains are embedded in the elastic matrix of the grains 
which do not participate in reversed slip. The stress range dia- 
grams of polycrystalline metals and tests with single crystals in- 
dicate that normal stress, i.e., tensile stress acting on the planes 
of reversed slip, reduces the fatigue strength. Stress range dia- 
grams for low temperature suggest a linear relationship between 
reversed shear stress and normal stress.?. If a mean stress is ap- 
plied at elevated temperature, relaxation from this normal stress 
may occur for the grains which are subjected to reversed slip, 
whereas the elastic matrix carries the static load without sig- 
nificant increase in deformation or change in creep behavior. The 
localized relaxation of normal stress, however, increases the life 
of the specimen or, in other words, permits higher stress ampli- 
tudes for the same nominal mean stress and the same specimen 
life. 

Localized relaxation explains also the three stages of the stress 
range diagram for notched specimens [9]. The apparent initial 
rapid decrease of allowable alternating stress in the first range is 
due to the fact that nominal stresses (load divided by minimum 
cross-sectional area of the notched specimen) are plotted instead 
of true stress (approximately, stress concentration factor times 
nominal stress). This procedure distorts the diagram and re- 
duces the scale as compared with the curve for unnotched speci- 
mens. A further increase in nominal mean stress beyond the 
corresponding yield strength causes primarily a propagation of the 
plastic zone and little or no increase in actual mean stress (even- 
tually caused by strain hardening). In other words, relaxation 
of normal stress occurs in the fatigue affected zone due to plastic 
deformation and the actual mean stress is nearly constant al- 
though the nominal mean stress is further increased. This ex- 
plains why in the second stage of the stress range diagram the 
alternating stress amplitude is apparently little or not at all af- 
fected by the mean stress. 

In the transition to the third stage a further decrease of al- 
ternating stress is caused by a change in fracture mechanism from 
fatigue to creep, and crest stress, i.c., the sum of alternating and 
mean stress, is the dominating factor. 

The effective stress concentration factor which is derived from 
the stress range data for unnotched and notched specimens 
usually increases at first with decreasing alternating to mean 
stress ratio. It reaches a maximum at intermediate ratios from 
which it decreases to values smaller than one. This trend results 
from the particular shape of the stress range curve for notched 
specimens which is produced by the mechanisms discussed in the 
previous paragraph. The maximum effective stress concentra- 
tion factor occurs at the transition from the first stage to the 
second stage of the curve which is determined by the initiation of 
macroscopic relaxation effects in the volume which is subjected 
to maximum strain. 


Model Analysis of Relaxation of Mean Stress During Fatigue 
Cycling 

With regard to the response to combined alternating and mean 
stress one may distinguish two phases in the test specimen. One 
of them represents the grains which are subjected to fatigue and 


2 For a discussion of this relationship see Refs. [11] and [12]. 
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which extend only over a small fraction of the entire volume, the 
other represents the matrix which is, at elevated temperature, 
primarily subjected to creep under constant stress. These two 
phases may be represented by two nonlinear Maxwell units in 
parallel, which are coupled so that their total strain is the same. 
This condition is expressed by the equation 


+ = C2 + Ca 


(1) 


where ¢ is the elastic strain and e, is the creep strain. Index 1 in- 
dicates the unit which represents the fatigue affected grains and 
index 2 indicates the unit which represents the matrix. Differ- 
entiating Equation (1) and assuming that the contribution of 
phase 1 to the load carrying capacity of the specimen is negligible 
results in the equation for the strain rates 


So. 1 
Ber + (K/e}* 
* — (K/c) 
for 


V1 + 


801 


t= 
Ee. + (K/c)® 


— (K/c) 
+ (K/e)? 


— Ke) — V1 + + (K/c)* + — K/c)] 


or, in another form, if the strain rate is expressed as a function of 
stress é, = f(s) the differential equation for this model is 


1 d 
— f(a) = E 


(2) 
where E is Young’s modulus. Expressing the creep function by 
the hyperbolic sine relationship é, = c sinh (s/s») and integrating 
Equation (2) gives the time ¢ at which the stress reduces from the 
initial stress s; to the stress s 


(3) 
— sinh (8/8) 


where K = f(8:) = c: sinh (8;/8) = const because no relaxation 
occurs in phase 2, Integration of Equation (3) results in 


+ (K/e)* + — K/e)] [V1 + (K/e)* — (e”™ — K/c)] 
+ (K/c)* — (e/™ — K/c)| + -(K/c)? +(e" —K /e)| 


(4a) 


[Vi 4 + + (K/e)* + (e* ‘om — K /c)| —K — Vi (K/c)*] 


(4b) 


of psi 


Asymptotic 


Stress 


Stress — Thousands 


0.5 


1.0 1.5 


Time — Hours 


Fig. 2 Relaxation curve for the alloy $-816 at 1350 F and an initial stress of 30,000 psi 
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Equations (4) were evaluated using data for the alloy S-816 
(3). The constants c, and se were determined from the creep 
curves using Nadai’s chart [10]. Furthermore, it was assumed 
that the difference in creep behavior between the two phases is 
characterized by the difference between c: and c, and that 82. = 
8. The numerial values of the constants is shown in Table 1. 
The method of determining ¢, is discussed later. 


Table 1 Constants for the alloy S-816 


Test 

temperature, 2, 802, C1, 
deg F in/in/br psi in/in/hr 
1350 2.90 X 10-* 5550 8.00 x 1075 
1500 1.08 10-6 3300 X Wwe 
1650 4.00 x 1078 1560 3.60 xX 10-5 


Fig. 2 shows a relaxation curve which was determined from 
Equation (4) for 8-816 at 1350 F, assuming an initial stress of 
30,000 psi. The diagram indicates a rapid relaxation of stress 
to an asymptotic value s, = 11,675 psi in about 2 hours. This 
asymptotic stress is determined by a critical value of exp(s/so) in 
Equation (4) at which the denominator becomes zero. It in- 
creases more or less linearly with increasing initial stress above 
a certain value of initial stress as shown by the diagrams of Fig. 3. 
This value of initial stress appears to decrease with increasing 
temperature, and the rate of increase of asymptotic stress appears 
to be about the same at all three temperatures. The initial rapid 
decrease of stress and the occurrence of an asymptotic stress is 
significant for consideration of fatigue damage. 


Application of the Relaxation Model to the Stress Range 
Diagram 

The application of this concept for describing the strength be- 
havior under nonlinear alternating and static stress may be il- 
lustrated by use of functional relationships. In a first approxima- 
tion it is assumed that the relationship between alternating stress 


s, and mean stress 8,, also is linear at elevated temperature if no 
relaxation would occur, i.e., 


444 / 1960 


Fig. 3 Relationship between initial stress and asymptotic stress for the alloy S-816 at 
different temperatures |v) corresponds to c, in Equation (4)| 
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Fig. 4 Dimensionless stress range diagram for unnotched specimens of 
the alloy S-816 comparing the theoretical curve with experimental 
data 


8, + ks,, = const (5a) 


The nominal mean stress is identical with the initial stress s;. 
Because of relaxation the effective mean stress s,,.:; acting on the 
fatigue affected grains is a function of nominal mean stress or 
initial stress, and decreases with time ¢ 


Smett = t). (5b) 
With the relaxation model given this relationship is determined 
by Equations (4). Since the initial rate of relaxation is very high 


it may be permitted to assume that fatigue damage corresponds 
to that produced by a stress equivalent to the relaxed or asymp- 
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totic stress. Combination of Equations (5a) and (5b) result in 
= const — k-f(s,,, t). (5c) 


Test data for 8-816 alloy were evaluated using Equations (4) ac- 
cording to the relationship (5c). The constant c as listed in Table 
1 is determined by fitting the theoretical curve through the ex- 
perimental points. 

The theoretical curves are shown in the stress range diagram of 
Fig. 4 together with the test points. Up to a ratio of mean stress 
versus creep rupture stress of 1.0 both curves can be brought to 
close agreement. The theoretical curve is limited by a vertical 
line at the ratio 1.0which represents creep rupture. Since the model 
was derived for the effect of mean stress on fatigue strength only 
it can not explain the details in behavior in the creep rupture re- 
gion. The experimental curves indicate in this region that adding 
a small alternating stress allows a higher mean stress for the same 
life than static load alone. Considerable evidence points toward 
retardation of creep rupture caused by strain rate effects which 
are introduced by alternating stress. A similar retardation effect 
is observed for dynamic creep and has been analyzed using this 
concept in a previous publication [8]. A detailed analysis of 
this effect is beyond the scope of this paper. 


Stress Relaxation During Fatigue in Notched Specimens at 
Elevated Temperature 


The concept discussed in the previous paragraph can also be 
applied to notched specimens for stresses within the first stage of 
the stress range diagram. In this range fatigue occurs only in 
few grains along the circumference of the root of the notch. At 
the initiation of the second stage, however, a relatively large 
volume is subjected to relaxation effects. There are not enough 
data available at this time for the first and second stage of the 
stress range diagram of notched specimens which would permit a 
theoretical analysis of the mechanism in these ranges. 


Summary and Conclusions 


If a material is subjected to combined alternating and static 
stresses relaxation of static mean stress for the grains which are 
subjected to fatigue may occur primarily at elevated tempera- 
tures. This mechanism can be represented by two parallel non- 
linear Maxwell units. The model can describe the observed be- 
havior quantitatively and permits the computation of the fatigue 
stress range diagram. The model analysis indicates that relaxa- 
tion from mean stress occurs asymptotic to a finite stress which is 
determined by the initial stress. Other mechanisms such as re- 
covery or strain rate effects which may also affect the behavior 
are considered to be of secondary importance as compared with 
the effect of relaxation. 

Relaxation effects produce three stages in the stress range dia- 
grams of notched specimens. At present these stages can be 
analyzed only qualitatively and more experimental information 
is necessary for a quantitative theoretical analysis. 
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DISCUSSION 
M. J. Manjoine® 


The author points out that at low mean-to-alternating stress 
ratios the strain rate effects are small compared to the relaxation 
effects for materials under combined alternating and static stress. 
At high mean-to-alternating stress ratios, however, net flow 
occurs and strain-rate effects must be considered. The writer‘ 
has postulated the rate of loading from the minimum to maximum 
stress (alternating-stress range) and the maximum stress governs 
the incremental flow which adds to the creep strain resulting from 
the minimum stress. The effective stress for flow and rupture, 
therefore, lies between the minimum and mean stress; this results 
in a higher apparent creep-rupture strength at high mean-to-al- 
ternating stress ratios. 


Volker Weiss 


We are currently conducting thermal-cycling fatigue tests under 
a research contract with the Army Ballistic Missile Agency. 
The temperature range of thermal cycling is between —320 F and 
+600 F. Various light metal alloys such as aluminum and mag- 
nesium base alloys are being studied. The specimen is subjected 
to a constant load and immersed in liquid nitrogen. Thermal 
cycling is achieved by periodically passing a high current (up to 
300 A) through the specimen. Thus maximum temperature, 
the heating rate, and the duration of the cooling cycle could be 
varied. The total number of cycles which a specimen withstood 
for a given set of conditions was recorded as a function of applied 
nominal stress. Time versus elongation and temperature versus 
elongation curves are recorded for each test. 

Fig. 5 is a photograph of the test stand. The grips of the stress- 
rupture machine were modified to allow immersing the specimen 
in liquid nitrogen. The three instruments are (a) a Brown con- 
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troller which controls the maximum temperature of each cycle, 
(b) a speedomax recorder which plots the variation in specimen 
length against time, and (c) a speedomax z-y recorder which plots 
instantaneous temperature against length of the specimen. 

Fig. 6 shows the effect of applied nominal stress, maximum 
cycling temperature, and heating rate on the number of thermal 
cycles withstood to failure of the ALCOA aluminum alloy X-2020. 
The yield strength values for the respective maximum cycle 
temperatures after '/. hour at temperature, as reported by 
ALCOA, are also listed for comparison. 


Specimens were tested 
under moderate and high heating rate, i.e., approximately 100 
F/sec and 1000 F/sec, respectively. It can be seen from Fig. 6 
that the maximum cycle temperature appears to be the domi- 
nant variable controlling the mechanical behavior of the mate- 
rial under investigation. 


From the few 
experimental results shown here, however, it is evident that both 
maximum cycling temperature and, to a lesser degree, heating 
rate have an effect on the thermal-cycling fatigue behavior of the 
material investigated. 


Analysis of our data is by no means completed. 


Both thermal stresses as well as creep 
seem to be active in promoting failure. 
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Mr. Manjoine draws attention to the creep retarding effect 
caused by strain rate which is introduced by the superimposed 
alternating stress. From this behavior it may be inferred that 
rupture may also be retarded. The effect of strain rate becomes 
particularly apparent at high alternating-to-mean stress ratios. 
It may aid in the explanation for the transition from fatigue to 
stress rupture conditions as indicated in Fig. 4. However, as 
noted in the paper (page 442) the significant retardation of rup- 
ture which is observed in many instances [8] cannot be entirely 
explained by strain-rate effects. 

The interesting study of the effect of mean stress on thermal 
fatigue under different heating rates reported by Mr. Weiss in- 
dicates that probably localized relaxation may influence the 
strength behavior under the particular test conditions. 
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Cyclic Operation of Pressure 


K. R. MERCKX 


Hanford Atomic Products 
Operation, Richland, Wash. 


Piping With -y Heating 


An elastic-plastic analysis is developed for an internally cooled pressure tube with unt- 
form heat generation. 


This analysis extends the method of calculating the location of the 


elastic-plastic boundary reported by Barrie |4\' to account for the change in the plastic 


zone due to residual stresses which occur during cyclic operation. 


Numerical calcula- 


tions are made for operating conditions expected to be encountered in a pressure tube in 


a loop through the Engineering Test Reactor Core. 


The numerical results show that the 


radius of the initial plastic boundary decreases during subsequent loading cycles. Also, 
for equal maximum pressure and volumetric heat generation, the total plastic strain per 
operational cycle on the inner tube surface and the residual tensile stress on the outer tube 
surface increase when the tube wall is thickened. 


Introduction 


= PRESSURE PIPING can undergo gamma 
heating which causes mechanical effects comparable to those 
created by the pressure loading. Such a pressure tube is installed 
in a core facility of the Engineering Test Reactor at Arco, Idaho. 
The gamma heating in this facility is severe enough to cause 
plastic yielding for any tube wall thickness sufficient to with- 
stand the internal pressure. Under such operating conditions, 
design analyses should consider the plastic yielding due to cyclic 
loading as well as the long term effects of steady-state operation. 
Hence, the following approximate elastic-plastic solution is de- 
veloped in order to evaluate the changes in cyclic plastic strains 
which occur for various pipe thicknesses under the cyclic loads of 
an internal pressure and a uniform volumetric heat generation. 


Methods of Analysis 

The analysis of an internally cooled and pressurized pipe with 
uniform heat generation described in this report is made in order 
to estimate the elastic-plastic boundaries which occur after 
residual stresses are established by cycling both the internal pres- 
sure and amount of heat generation. The “heating cycle’’ is con- 


! Numbers in brackets designate References at end of paper. 

Contributed by the Metals Engineering Division and presented 
at the Metals Engineering Conference, Albany, N. Y., April 29-May 3, 
1959, of Tue American Society oF MecuanicaL ENGINEERS. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, January 
30, 1959. Paper No. 50—Met-4. 


sidered to be full internal pressure and gamma heat generation 
and the “cooling cycle’’ is considered to be zero internal pressure 
and heat generation. The maximum internal pressure in the pipe 
is P; the pipe is internally cooled; and the maximum rate of 
volumetric heat generation is g. The following assumptions have 
been made regarding the behavior of the material: 


1 Perfect plastic-elastic behavior 

2 Equal yield strengths in tension and compression 

3 No Bauschinger effects [1] 

4 The material properties are independent of temperature 
5 Maximum stress is the yield criterion (0, = yield stress) 


In this problem, the maximum stress amplitude in the region of 
vielding is the circumferential stress, therefore, ¢, = +0, in the 
region of plastic yielding. 

In order to facilitate the ca)culations used in this analysis, the 
following elastic solutions are used to determine the stress dis- 
tributions in the elastic regions of the pressure pipe or thick-walled 
cylinder. 

1! The ratio of the circumferential stress [2] to the yield stress 
8, = @,/¢,, for an infinite length internally cooled cylinder with 
uniform volumetric heat generation qg and zero surface traction at 
radii au,,, and aug is 


B 


Up? 
u,;) = (u? + In 
Qu? — Uys? 


— (%? )u? In 


u 


2 
u 
| + Su Ug*U 


(1) 


internal heat generation /vol- 
ume 


tube thickness 


(total circumferential stress ) / 


pressure 
elastic modulus 
coefficient of linear thermal 
expansion 
thermal conductivity 
Poisson's ratio 
yield stress 
Eaqa* 
8K(1 — v)a,’ 
rameter containing ma- 
terial properties, heat 
generation, and size 
a inner tube radius 


Journal of Basic Engineering 


problem pa- 


temperature drop across the 
tube 

P/o,, pressure parameter 

(radial distance)/(inner ra- 
dius ) 

(outer radius) /(inner radius) 

(radial distance to plastic 
boundary )/(inner radius); 
for cycle i 

circumferential stress 

radial stress 

subscript signifying cycle; 
( )o initial heating; ( ); 
first cooling; etc. 


(yield stress); for cycle 


o,/o, for the stress in a 
cylinder with inner pres- 
sure P and outer radial 
stress o, if the circum- 
ferential stress = ¢, 

8r(U, U,,) g,/o, for location u in a 

cylinder with inner bound- 


ary u,, for thermal stresses 


Lad 


sp(u,u,,) = o,/o, for location u in a 
cylinder with inner bound- 
ary u,,; for radial stress 
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it The ratio of the circumferential stress [3] to the yield stress 
8» = o,/¢, for an infinite length thick-walled cylinder with a sur- 
face traction of s, = ¢,/o, at radius au,, and zero surface traction 
at radius au is 


(5% —) + (2) 


Both of these equations are for conditions of plain strain where 
the resultant axial force can be adjusted to any prescribed value 
with the addition of a uniform axial stress. The equations are 
written in dimensionless form suitable for numerical evaluation. 
The highest stress magnitude in the pipe during the pressuriza- 
tion and heating are the circumferential stresses on and near the 
inner boundary of the tube; thus, the maximum stresses, the 
circumferential stresses, reach the yield stress o, in the plastic 
region | < u < up; (¢ = 0, 2,....) during the “heating cycle.”’ 
Since the maximum yielding occurs along the inner surface of the 
tube, the circumferential residual stresses will be of the highest 
absolute magnitude along the inner surface of the tube. If the 
pressure and thermal incompatibilities are of sufficient magnitude, 
their removal will cause yielding along the inner regions of the 
tube 1 <u < u,; (i = 1,3,....) and the circumferential stress 
will be o, = —a,. Therefore, in the plastic portion of the tube, 
the ratio of the circumferential to yield strength s, is 
s(u) =(—-1)',, lous uy (3) 
The calculations which follow are made by assuming that the 
change in circumferential stresses within the plastic region can be 
determined with a knowledge of the previous stress state and the 
new circumferential stress value. If the change in the circum- 
ferential stresses are known then, by equilibrium considerations 
within the plastic region, the change in radial stress can be de- 
termined. The change in radial stress at the elastic-plastic bound- 
ary is calculated and assumed to be the surface traction on the 
elastic portion. The changes in the circumferential stresses in 
the elastic region then occur due to the change of the temperature 
within the elastic region and the change of inner surface traction. 
These changes are either subtracted or added to the original stress 
system so that the yield conditions are satisfied in the plastic 
region, thermal effects are accounted for, and the correct pressure 
exists on the inner tube surface. The location of the elastic-plastic 
boundary is found by equating the change in the circumferential 
stress in the elastic region to that of the plastic region at the elas- 
tic-plastic boundary. The detailed explanation of this method of 
determining the location of the plastic boundaries u,; and the 
total circumferential stresses throughout the tube follows: 


Initial Heating and Pressurization 


For a pipe in a stress free state, the initial application of pres- 
sure and y-heating (if of sufficient magnitude) will cause plastic 
yielding on the inner boundary. The circumferential stress dis- 
tribution is of the form shown in Fig. 1. The force equilibrium 
equation in y-direction for the plastic portion of the quadrant of 
pipe shown in Fig. 1 is 


d 1 
aP +co, = — or 8,(u,,) = —— + (1 ) (4) 
Uno Uno 


where 


P 
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The maximum stress criterion specifies the circumferential stress 
in the plastic region, therefore, allowing equation (4) to be evalu- 
ated using only equilibrium considerations. The value of s,(u,,), 
equation (4), is needed to determine the portion of the circum- 
ferential stresses in the elastic region due to the surface traction 
transmitted across the elastic-plastic boundary. The problem in 
the elastic region now resolves into an internally cooled cylinder 
with an inner radius au,,, uniform internal heat generation q, 
and the surface traction. Therefore, the total circumferential 
stress in the elastic region is Obtained by superposition of the 
elastic solutions, equations (1) and (2). At the elastic-plastic 
boundary, the total circumferential stress equals the yield stress; 
hence, 


= Upo) + Sp(Upo, Upo) (5) 


where equations (1), (2), and (4) are used to evaluate 8,7, 8p, 8, 
with u = u,, and u,; = u,,. Equation (5) is satisfied only if the 
correct value of u,, is assumed; therefore, equation (5) is solved 
by trial and error for u,,, the first or initial plastic boundary. The 
resulting stress distribution throughout the cylinder or tube can 
then be evaluated: 

l<u<u 


— “po 


= 8p(U, Up.) + Sp(U, Up SUSU (6) 


This method of evaluation was previously used by Barrie [4]. 
The method of estimating the subsequent stress distributions 
during cyclic operation is the author’s extension of Barrie’s 
method. 


Cooling Cycle 


When the pressure and heat generation are removed, the re- 
sidual circumferential stress is found by subtracting the elastic 
solutions 


8p(u, u,;) and 87(u, u,;) from (¢ = 1, 3, 5,.. .) 


in all of the cylinder which behaves elastically. If conditions are 
severe enough, plastic yielding in compression occurs on the inner 
portion of the cylinder. The radius of the elastic-plastic bound- 
ary during cooling will be less than that of the preceeding heating 
cycle; hence, throughout the new plastic region and on its 
boundary, the decrease in the circumferential stress is twice the 
yield stress. The new value of s, used in the calculations for the 
stress decrease within the elastic region is determined with the 
same equilibrium considerations as were previously used in ob- 
taining equation (4), except a, is replaced by 2¢,. The result of 
this analysis is 


d 1 
= +2(1 - — ). (7) 
pl Apr 


The new elastic-plastic boundary is determined in a manner 
similar to that described for equation (5), except the decrease of 


y 
Total Pipe Plastic Portion 
f 
a Elastic Region a 
Plastic Region 
P 
\ 
>* > * 
| 


Fig. 1 Circumferential stress distribution 
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where 
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Fr’ 
i 
| 
4j 
- 
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the total circumferential stress in the elastic region at the new 
elastic-plastic boundary equals twice the yield stress: 


2= 87( Upi,Upi) + (8) 


Equation (8) is solved by trial and error for the new elastic-plastic 
boundary up,. The u,,; (¢ = 1, 3, 5,...) remains constant since 
all radii for the cooling plastic boundaries are less than those for 
the heating plastic boundaries in this analysis; hence, equations 
(7) and (8) do not change for the different cooling cycles. The 
ratio of the total circumferential stress to yield stress for any cool- 
ing cycle is: 


s(u) = —1. 8 


= — 8p(u, Um) — Up)... Up Su <u (9) 


where i = 1, 3, 5, .. . Equations (1) and (2) are used to evaluate 
8p and 8, while s,-, is obtained from the previous heating cycle 


and 8,(u,;) is given by equation (7). 


Subsequent Heating Cycles 

The plastic yielding occurring during previous cycles of opera- 
tion causes residual stress systems which affect the amount of 
increase in circumferential stress necessary for plastic yielding. 
The radius of the cooling plastic boundary is less than the radius 
of the subsequent heating plastic boundary; therefore, the in- 
crease of the circumferential stress necessary to cause plastic 
yielding at the elastic-plastic boundary is less than twice the yield 
stress and is 


1 — = + Up)... (i = 2,4,... 


where 8,-,(u,;) > —1 
In order to determine the radial stress used for the surface trac- 
tion in equation (2) for sp at the elastic-plastic boundary, only the 
increase in circumferential stress in the new plastic region is used 
in the equilibrium argument. Thus the shaded area in Fig. 2 is 
the force increase due to the change in circumferential stress in the 
plastic region. Using the same equilibrium considerations as 
previously used to obtain equation (4), 
a(u,; — U,;) 


aP + au,,o, ~ a(u,; — 1)20, — (o, + o%,_,) 


d (Up; — Ups) 
= — (1 + 8-1). (10) 


2 


u 


The ratio of total circumferential stress to yield stress for any 
heating cycle other than the initial is 


lsucsu,, 


= 8i-(u) + 87(u, + Sp(u, Uy)... up Susu, (11) 


where i = 2,4,6.... The plastic boundary is determined by 


aUpi 


| 
Radia! Distance 


Yj 
A 
' Previous Cooling Cycle 


Fig. 2 Change of circumferential stress in the plastic region during 
heating 


Journal of Basic Engineering 


substituting a series of increasing u,,; into the second part of equa- 
tion (11). The first trial u,, that satisfies s,(u) < 1 for all u ? uy, 
is the correct u,; Equations (1), (2), and (10) are used to 
evaluate 87, 8p, and 8,. 


Cases for Evaluation 


The following cases are evaluated in order to determine the ef- 
fects of various pipe thicknesses and yield stresses on the total 
plastic yielding per cycle. The calculations are made for design 
conditions of a pressure pipe in the Engineering Test Reactor 6 X 9 
core facility. The material properties are for 347 stainless steel 
in the annealed condition and at 600 F. The following properties 
(5) are used in the numerical calculations: 


Design pressure, P = 2,500 psi 

Assumed ¥, heating, g = 30 w/gm = 3,945 w/in.* 

Inner tube radius, a = 1.719 in. 

Material, 347 s.s., properties 600 F 

Poisson’s ratio, » = 0.28 

Elastic modulus, E = 21 X 10° psi 

Yield strength, ¢, = 32,000 psi 

Thermal conductivity, K = 11.2 Btu/hr-F-ft = 0.273 w/in-F 
Coefficient of thermal expansion, a = 9.8 X 10~ (in./in.)/F 


A thickness was calculated [6] which minimized the total cir- 
cumferential stress for an elastic material. This calculation 
yielded a thickness { = 0.114 in., a thermal circumferential stress 
component (¢,)7 = 18,600 psi, and a circumferential stress due 
to pressure (¢,)p = 39,100 psi. Since the sum of these circum- 
ferential stresses is above the yield stress of annealed 347 stainless 
steel, and the pressure component of stress was above any allowa- 
ble working stress, the design on an elastic basis is unrealistic. 
Therefore, the plastic-elastic calculations using the method de- 
scribed in this report are used to estimate actual conditions in 
thicker pipe sections. Trial thicknesses are selected for reasona- 
ble working stresses ¢,, calculated on the basis of only consider- 
ing pressure effect. Thicknesses corresponding to working stresses 
are calculated [7] with the equation 


Pa 
o,, — 0.6P 


The following values of working stress are assumed: 
o,, = 22,000; 18,000; 15,000 psi 


Using the three values of working stresses and two values of yield 
stress of 32,000 and 25,000 psi, the following six cases were 
evaluated by the elastic-plastic solution described in this report: 


Eaqa* 


Uo d = p/ay 8A(1 — 
.122 0.0781 7.68 
. 152 0.0781 68 
185 0.0781 7.68 
122 0.1 1.03 
152 0.1 
. 185 0.1 03 


The temperature drops across the tube AT and inner surface 
heat fluxes are calculated with the following equations: 


2 
AT = = (ug? In we? — ue* + 1) 


Flux = — 1)g 


The values of these quantities for the various working stresses are 
given in the following table: 
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Working 
Thickness, Heat flux, 
in. Uo F Btu /hr-ft? 
0.210 432,000 
0.261 545,000 
0.318 674,000 


Results of Calculations 


The numerical results obtained for the total circumferential 
stresses are graphed in Fig. 3, Case I to Case VI. The plastic 
boundary becomes larger as the thickness increases or as the yield 
strength decreases. Subsequent changes in the stress distribu- 
tions for either the cooling or heating cycles are insignificant, but 
the radius of the plastic boundary for the first heating cycle is 
significantly reduced during subsequent heating cycles. 

As the pipe thickness is increased or the yield strength is de- 
creased, the residual tensile stress during the cooling cycle on the 
outer surface of the tube is increased. This stress almost reaches 
the yield stress for Case VI, which is the example with the thickest 
pipe wall and the lowest yield stress. 

Values for the plastic straining on the inner surface of the tube 
can be estimated by assuming: 


(a) No volume change occurs in the plastic region due to the 
stresses ; 

(6) Axial strain is constant and is known; 

(c) The axial and radial stresses at the plastic-elastic bound- 
ary do not appreciably alter the circumferential strain. 


Thus, the displacement on the plastic-elastic boundary, 5(u,,), is: 


Accounting for the total change in volume in the plastic region 
due to the thermal gradients, the following relation for the strain 
at the inner boundary is obtained: 


6 ) 3 
Kup) + (u,? — 1) aT(u®)du? 
1 


1) 
€{1) = pi a 


The combination of these two equations gives the following esti- 
mate of the circumferential plastic strain: 


€, = — @T(1) — = 1) + 


+ au,? T(u,,) — aT(1) aT(u®)du*®, (12) 
1 


For the internally cooled tube with 7(1) taken as a reference 
temperature, a = constant, and €, = 0, equation (12) becomes 


2 2 2 
Uo? In 


€, = — 1) +(1—v)B (3 uo? — 


The following table contains values for the plastic boundaries 
and the total plastic strain per cycle calculated with this equa- 
tion: 


Cooling 


Heating Plastic strain 
boundary 


boundary per cycle, 

per cent 
1.0189 0.182 
1.0369 0.449 
1.0589 0.859 
1.0322 0.301 
1.0498 0.579 
1.0721 1.014 
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DISCUSSION 
Irwin Berman? 


The author is to be commended for his presentation of a 
practical example in which it is necessary to determine the 
elastic-plastic behavior of an infinite length thick-walled cylinder 
subjected to cyclic loading. The problem is solved by the use 
of the maximum normal-stress yield criterion. As mentioned 
by the author in his presentation, however, the use of the maxi- 
mum shear-stress yield condition with its associated flow rule 
may lead to a better solution. 

The maximum shear-stress yield condition with its associated 
flow rule has previously been employed in similar problems. 
Prager and Hodge [8]* and Hodge [9] discussed the thick cylinder 
in terms of shakedown under repeated internal pressure loading 
and unloading. Bland [10] presented a more extensive solution 
in which he considered, “elastoplastic thick-walled tubes of 
work-hardening materials internal and external 
pressures and to temperature gradients.’’ The solution for a 
perfectly plastic material was also discussed by Bland [10] as a 
special case. 

The assumption by the author of a perfectly plastic material 
may be reasonable even for a work-hardening material such as the 
347 stainless steel which was chosen for evaluation. This is so 
because the approximate plastic-strain-per-cycle values as cal- 
culated by the author, for the fairly small ratios of the outer to 
the inner diameter of the tube, are low. Any extension of this 
paper to cylinders in which there may be strains of a few per cent 
would necessitate the consideration of strain-hardening. 

This discusser would like to see some graphs of temperature 
plotted against thickness for the various conditions so as to be 
better able to evaluate the limitations of the author's assumption 
that the material properties are independent of temperature. 

The discusser believes there may be an error in the values of the 
elastic modulus and Poisson’s ratio which the author used for 347 
stainless steel in the annealed condition at 600 F. The values 
of these material properties are not evident in the author’s 
reference 
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D. R. Miller‘ 


The problem which the author has analyzed illustrates quan- 
titatively a concept which has been widely understood qualita- 
tively. This concept is that large thermal strains in a ductile 
metal produce plastic flow rather than extremely high stresses. 
It is now also widely recognized that plastic thermal strains can 
produce fatigue failures when the temperatures are cycled a 
sufficient number of times. 

It should be noted that the author has not followed Coffin [11 |® 
in defining the plastic-strain range as the amount of tensile or 
compressive plastic strain during a complete cycle of loading, 
but rather shows in his table the arithmetic sum of the tensile 
and compressive plastic strains during a cycle, computed by 
using equation (12). 

An interesting observation that may be made in studying the 
present problem is that the maximum plastic strain which occurs 
ov loading or unloading after the initial application of load is not 
simply the difference between the maximum strain computed 
on an elastic basis and the actual elastic-strain range from com- 
pressive yielding to tensile yielding. The simple situation, 
which is characteristic of a temperature-cycled bar with con- 
strained ends, is modified in the tube, because in the latter the 
occurrence of plastic flow over a portion of the wall thickness 
produces an expansion or contraction of the tube diameter in a 
direction which increases the magnitude of plastic strain. These 
changes in diameter caused a 13 per cent increase in the amount 
of plastic flow in the writer’s approximate analysis of circum- 
ferential strains in a thin-walled tube having thermal and mem- 
brane stresses similar to those of the author’s Case IV. Thus it 
is evident that elastic-stress analysis alone is not a sufficient 
basis for evaluation of low-cycle fatigue problems in cases where 
significant strains can be produced in elastically stressed regions 
because of plastic flow which occurs elsewhere. 

The author has assumed that the thermal and pressure loadings 
are cycled simultaneously. This assumption is not realistic for 
an in-pile pressure tube, since the pressure loading will be applied 
prior to start-up of the reactor and will be removed after shutdown 
of the reactor. Thus the thermal loading is cycled while the 
pressure load is maintained. A recent paper by the writer [12] 
showed that this condition would produce a progressive growth 
in diameter of the tube if the loadings were sufficiently severe. 
The criteria derived by the writer indicate that progressive growth 
would occur in all of the author’s cases if the heating were 
applied and removed repetitively while pressure loading was 
maintained. 

After establishing that adequate dimensional stability is 
assured in a particular in-pile tube design, a complete evaluation 
would include an analysis of longitudinal stress and strain dis- 
tributions. The results of the analysis of circumferential and 
longitudinal stresses and strains then would be combined to 
determine the triaxial stress-strain history in the region of 
maximum plastic strain. If suitable criteria were available, 
these conditions of low-cycle fatigue under triaxial stresses could 
be correlated with low-cycle fatigue data on uniaxially stressed 
specimens. Consideration is being given by various groups to 
experimental studies directed at correlations between uniaxial 
and biaxial stress low-cycle fatigue data, but little, if any, cor- 
related data are available. Lacking such correlation, it may be 
reasonable to assume that a specified range of radial strain in the 
tube is approximately equivalent to the same longitudinal 
strain range in a uniaxially stressed low-cycle fatigue specimen 
of the same material. This assumption attributes primary 
significance to similarity of triaxial strains in the tube and in 
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the fatigue specimen and ignores the significance of the dif- 
ferences in the stresses. 

The author has pointed out the need for consideration of the 
long term effects of steady-state operation. The most important 
factor to be considered will probably be the effect of neutron 
bombardment, which may increase the yield strength of a ductile 
metal and reduce its ductility. These effects must be considered 
with respect to low-cycle fatigue and, in some materials, must be 
considered with respect to the possibility of elevation of the 
brittle-fracture-transition temperature to an unsafe level. 
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Author’s Closure 


As I. Berman points out, the solution presented in this paper 
does not use the usual shear stress or Mises yield criterion. Had 
the author been familiar with Bland’s {10] work, the shear stress 
criteria would have been applied before programming the numeri- 
cal calculations. After the numerical results were obtained, 
the author had derived equations which were based on the shear 
stress yield criterion and these equations would not have pre- 
sented additional programming difficulties. In the relatively 
thin-walled tube analyzed in this paper, the variation in numeri- 
cal results based on the maximum shear stress yield criterion 
or maximum stress criterion would not be significant. D. R. 
Miller’s comments on critical evaluation of calculated stress and 
strain conditions in terms of physical behavior when mechanical 
behavior is not completely understood justify the use of approxi- 
mate numerical analyses. The purpose of this paper was not to 
present an elegant method of analytic analysis, but was to illus- 
trate that elastic analyses are not sufficient for calculating the 
plastic strain per cycle in systems undergoing plastic strain 
cycling. D.R. Miller’s suggestion that thermal-stress ratcheting 
should also be checked shows that design of structures under 
severe conditions will require a complete understanding of pos- 
sible failure mechanisms. The analytical work of the industrial 
designer is rapidly becoming more complex as service conditions 
are increased. Both more understanding of material behavior 
and new calculational methods are required; at present, many 
practical problems defy analysis because the material behavior 
cannot be given a realistic representation in terms of the variables 
which determine their life under actual service conditions. 

The assumption of material properties which are independent 
of temperature is not valid for the thick-walled cylinders with 
high y-heating. Temperature drops across the tubes are given 
in the paper, and, as is listed, the tube with 0.318-in. thick wall 
would have a 774-F temperature variation. Considering pos- 
sible film drops and a 550-F coolant temperature an outer tube 
wall temperature of 1400 F would be reached. At this tempera- 
ture, elastic analysis ceases to be valid and creep would occur. 
Thus, the design of thicker tubes does not only cause higher plas- 
tic strains based on an elastic-plastic analysis, but the increased 
temperatures cause the creep phenomenon to be significant. 
Thus, in the actual selection of a tube design, the analysis present 
in this paper is just one step in the selection of an adequate tube. 
A complete analysis would require a creep model which would 
allow for temperature, irradiation, and load cycling. Since 
such data are not available, only possible mechanisms of behavior 
can be estimated by models such as presented in this paper and 
D. R. Miller’s paper [12]. Knowing the inadequacy of the 
model for material behavior makes numerical values only of 
relative significance. 
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Creep Rupture Tests for Design of 
High-Pressure Steam Equipment 


Creep rupture tests on tubular specimens of type 316 stainless steel were run at 1200 F 
and at pressures up to 24,000 psi. The specimens were tested under pure internal 


pressure and equal biaxial tensions. 


The results of these tests correlate favorably with 


those of uniaxial tension tests if a comparison is made on the basis of effective stress and 


effective strain rate. 


Introduction 


HE DESIGN OF CERTAIN steam-turbine components 
for advanced high-pressure, high-temperature installations such 
as the Philadelphia Electric Company's Eddystone No. 1 Unit 
and the Avon No. 8 Unit of the Cleveland Electric Illuminating 
Company, demand new investigations into the combined stress 
behavior of materials to help assure the validity of the design 
concepts. At supercritical pressures the radial component of 
stress is of significant magnitude to become a dominant factor in 
the design of either cylindrical or spherical pressure vessels. 
With the present day state of design knowiedge, it is perplexing 
to decide what the allowable design stress should be when using 
triaxial combined stress formulas where all three stress compo- 
nents are of significant values. 

To help fill the void of actual test data on combined stress be- 
havior, the author’s Company instituted a rather unique testing 
program in which tubular specimens of type 316 stainless steel 
were subjected to triaxial stresses of various component ratios. 
Both pure internal pressure and equal biaxial tension tests were 
run. The test temperature was 1200 F, and pressures up to 24000 
psi were used in making the tests. The ratios of the stress com- 
These 
ratios represent the average values of the stress components. 

This paper suggests a possible means of correlating triaxial test 
data with data obtained from conventional uniaxial creep tests. 
It is hoped that a rather thorough analysis of a few complicated 
tests will shed sufficient light on the triaxial stress creep problem 
to permit design standards to be established on the basis of uni- 
axial tension creep tests. 


Material 


The specimens were cut from a 12-in. length of a 9*/, OD by 
4°/, ID sleeve forging. This material was reheated after forging 
to 1950 F and held at this temperature for 2 hours, after which it 
was quenched in water. The composition and room temperature 
properties are as follows: 


ponents used in the various tests are shown in Table 1. 


Cc Mn P Ss Si 
0.06 1.28 0.012 0.013 0.38 
Yield strength 

Tensile strength 
Elongation 
Reduction in area 
Vickers hardness 


Ni Cr Mo Cu 
11.21 17.18 2.25 0.22 
31000 psi 
79000 psi 
78.6 per cent 
80.9 per cent 
140 DPN 


Contributed by the Metals Engineering Division and presented at 
the Metals Engineering Conference, Albany, N. Y., April 29-May 3, 
1959, of Toe American Socrety or MECHANICAL ENGINEERS. 

Note: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, March 18, 
1959. Paper No. 59—Met-14. 
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Test Specimens and Testing Conditions 


The uniaxial creep tests were run on a 0.505-in. diameter bar 
with a 3-in. gage length. The tubular specimens had a 6-in. 
cylindrical gage length. The outside diameter in all cases was 
1'/, inches. The inside diameter of most of the tubes was */;. in. 
but for a few it was */s in. 

Two types of combined stress loadings, pure internal pressure 
and equal biaxiel tensions, were used. In the equal biaxial ten- 
sion tests an axial load was added to the internal pressure to make 
the average axial stress equal to the average tangential stress. 
Since, under creep conditions, the stresses all vary with the radius 
the local axial and tangential stresses were not equal. 

The tests were run in the testing machine shown in Fig. 1. 
This photograph shows a specimen in the machine without the 
furnace. The axial load for the biaxial tension tests was main- 
tained by keeping a constant deflection of the spring weighbar at 
the lower part of the machine. The internal pressure was sup- 
plied by an air-operated pump. The pressure fluctuations were 
held to a minimum by providing the pressure system with a 


Table 1 


Type of test 
Uniaxial tension, solid 

cylinders 0 
Pure internal pressure, 

OD/ID = 2 3 
Biaxial tension, OD/ 

ID =2 1 
Pure internal pressure, 

OD/ID = 3 l 4 
Biaxial tension, OD/ 


=3 1 


or/ Os 


l:1:—1 


calibrated leak. By adding a chamber filled with oil at room tem- 
perature to the high-pressure system and by providing a leak of 
one drop of oil about every 5 seconds through a capillary tube a 
continuous motion of the actuating piston was assured. This 
reduced the friction effect and kept the driving piston from 
‘freezing’ to the cylinder wall. The specimen was connected to 
this high-pressure system by heavy-walled tubing of small diame- 
ter. Thus the high-pressure water at room temperature in the 
pumping system became high-pressure steam in the specimen in- 
side the furnace. 

Special precautions had to be taken for protection against the 
explosive conditions at fracture. A reinforcing tube of stainless 
steel was put inside the furnace as a liner to protect the furnace 
windings. The outside of the furnace was covered with heavy 
steel tubing. 

In the equal biaxial tension tests the axial strain was measured 
with an extensometer attached to the shoulders of the specimen as 
shown in Fig. 1. Some of the tests were stopped from time to time 
and the specimen removed and the increase in diameter measured. 
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These measurements were necessary to determine the distribu- 
tion of the strains in the tube wall. 

In the pure internal pressure tests the increase in diameter was 
measured by a lateral extensometer with a 10X magnification. 
The lengths of these specimens were checked before and after 
test to determine whether or not any creep occurred in the axial 
direction. 


Stress and Strain Distribution 


The stress distribution in a heavy-walled cylinder depends upon 
the strength or creep theory under consideration. In any event 
progress can only be made by measuring or calculating the 
strain distribution. The first problem then is to attempt to de- 
termine the distribution of the strains or more particularly the 
distribution of the strain rates. This can be done with a minimum 
of measurements as long as the material deforms uniformly. Any 
such analysis will begin to deviate from the actual behavior as 
soon as an ultimate load is reached in a tension test or when the 
third stage of creep begins in a creep test. 

In a heavy-walled tube under internal pressure the radial and 
tangential strains at the bore are much larger than those at the 
outside surface. It appears advisable, then, to evaluate the 
strain distribution in terms of the natural strains é, although it 
may be necessary later to revert to the conventional strain ¢. 

The strains considered here are the total strains. There has 
been no attempt to separate the elastic and the plastic com- 
ponents of the strain. After the specimen is loaded in a creep test, 
however, all the measurable strain is plastic, and so the analysis 
should be considered to be in terms of plastic strain with the elas- 
tic component neglected. 

The natural strain components for conditions existing in the 
present tests (€, # f(r)) can be expressed as 


1 
& = 2 In (+ (1) 
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where Equation (3) is a definition of the constant c; in terms of 
the known or assumed strain in the axial direction ~, and where 


(4) 


In these expressions for the strain components, r is the radius 
measured in the undeformed tube, } is the original outside 
radius, and p, is the radial displacement of the outside of the tube. 

When the tubular tests were interrupted for measurement of the 
axial strain and the increase in diameter, these measured values 
were used to calculate the distribution of the strain components. 
In Fig. 2 the strain distribution in a typical specimen after 358 
hours is shown. For this example the axial strain é, was 0.017 and 
the increase in outside diameter was 0.010 in. (p, = 0.005). In 
this specimen, which is typical of several so examined, the axial 
strain and increase in diameter was rather low. These low values 
were chosen deliberately to try to show the correct strain dis- 
tribution during second stage creep. If readings had been used 
from specimens that were near the rupture point, the average 
values of axial strain or diameter measurements would not have 
given representative results. 

Another reason for making the analysis while the strains are 
small is the change that takes place in the stress ratios as deforma- 
tion proceeds. The tangential stress increases more rapidly than 
the axial stress as the inside diameter increases during the strain. 

As shown in Fig. 2, the radial and tangential strains are much 
higher at the inside radius than at the outside surface. The ef- 
fective strain which is defined as 


C2 = (b + p,)? — cb? 


= y2 — + (, — + — (5) 
so that ér: reduces to ~, for pure tension also varies with the 
radius and is greatest at the inside surface r = a. Under steady- 
state conditions such as are encountered during the second stage 
of creep the effective strain rate will vary with the radius in the 
same manner as the effective strain. 

In a thin-walled tubular specimen it is quite logical to use 
average values of the radial, axial, and tangential stresses to cal- 
culate an effective stress. It is quite logical to use the strains at 
the mean diameter to calculate an effective strain or effective 
strain rate. In the present analysis we shall develop expressions 
for the stress components and the effective strain rate as func- 
tions of the radius in order to compare the tubular tests with the 
uniaxial tension tests. 


.08 
.06 “4 
.04 4 
4 
4 fm 'e 
30 .40 .45 55 
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-.02- 
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-.06 


Fig. 2 Strain distribution in a typical specimen after 358 hours at load 
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For the case of a thin-walled tube under equal axial and tan- 
gential stresses the radial strain should be negative and equal to 
twice the axial or the tangential strain. It would be logical to 
compare these strains at the mean radius. In Fig. 2 it will be 
noted that at the mean radius r,, the tangential strain & is greater 
than the axial strain ¢,, and it appears that some other radius 
should be chosen as an equivalent radius. Since the effective 
strain [Equation (5)] includes the three principal strains in its 
make-up, it might be more logicai to choose that radius which 
divides the cylinder into equal volumes. This radius we will call 
the equivalent radius r,. 


(6) 


We will hope, then, that this equivalent radius will be the radius 
at which the axial and tangential strains are equal for the case 
where the average axial and average tangential stresses are equal. 
This choice of equivalent radius does not have much theoretical 
background and is only suggested because some such assumption 
must be made in order to proceed with the analysis of the stress 
distribution. The necessity of this assumption will become evi- 
dent later when the stress distribution is investigated. 

A corresponding assumption must be made for the case of the 
pure internal pressure tests. In a thin-walled tube under pure 
internal pressure it can be shown theoretically and experimentally 
that there is no extension in the axial direction. In the thick- 
walled tube this is not so obvious but it might be assumed as a be- 
ginning for a trial and error solution. The calculated and meas- 
ured resultant end loads could then be compared as a boundary 
condition. 

The analysis so far has been in terms of natural strain. It has 
been found, however, that in order to determine the stress dis- 
tributions it is too difficult to proceed with the more acceptable 
strains and that for convenience sake it is advisable to use con- 
ventional strains. It will be obvious later when the actual creep 
curves are presented that the strains are rather small in the second 
stage creep range, and therefore the analysis based on conven- 
tional strains will be sufficiently accurate. 

In order to determine the stress distribution from the measured 
and computed strain rate distribution it is necessary to adopt 
some governing creep law. Such a law might be expressed in 
terms of the effective stress o.1; and effective strain rate u = 
O€er1/Ol. These variables are defined by the relationships 

1 

on = V2 — o,)? + (o, — o,)? + (o, — (7) 

Uett = v [(ug — u,)? + (u, — u,)? + (u, — ug)?|'/* (8) 
wher the stresses are based on the original dimensions and the 
strain rates are the time derivatives of the conventional strains. 

Either a power function relationship or a hyperbolic sine rela- 
tionship might be used. In this analysis the power function has 


been used because it leads to an expression that can be integrated 
without difficulty for the case of zero axial extension. 


de. 
‘= Boe" (9) 
ot 


Uett = 


The relationships between the components of the strain rate 
and the stress components are assumed to be 


(10) 


(11) 
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(12) 


where W is some function of the stress and hence some function 
of the radius. From Equation (12) the axial stress is 


o, + % 


o, = pu, + 3 


(13) 

and when this is substituted in Equation (7) the effective stress 
becomes 

1 

Fen = — + (14) 


Since the sum of the three strain rate components is zero, the 
effective strain rate can be written as 


9 
Uett = [ue? + ugu, + u,?}'/? (15) 
v3 
The main reason for reverting to the conventional strains is to 
be able to use the compatibility equation for small strains 


6 
r—+e-—é =0 


(16 
> ) 


The use of this equation to determine the corresponding expres- 
sion for the strain rates will lead to a more simplified expression 
for the effective strain rate than would be obtained from Equa- 
tions (1), (2), and (3). 

The compatibility equation for the strain rates is obtained from 
that of the strains by differentiating it term by term 


r—+wu—u, =0 (17) 
or 6 


Since u, can be expressed in terms of ug and u,, which does not 
depend upon r, Equation (17) can be integrated. The expressions 
for the strain rates then become 


ug + (18) 


(19) 


The effective strain rate can now be expressed in terms of the 
radius and the axial strain rate. 


(20) 


= Var + 3u,?r*] 


Equations (10) and (12) can be combined with Equation (18) 
to produce a relationship between the stress difference (a — ¢,) 
and the radius r. 


3 Cs 


+ 4y r? 


(21) 


from which 


— o,)r* 


22 
(22) 


and when this is substituted in Equation (14) the effective stress 
becomes 


V 5 


= [4es* + 3u,*rt]'/* (23) 
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When Equations (20) and (23) are now substituted in the creep 
law, Equation (9), we obtain a final expression for the stress dif- 
ference (a@ — @,). 


n—1 (24) 


B¥n(3) 2” + 3u,2r4) 


— = 


The equilibrium equation for radial symmetry is 


oo, 
(25) 
or 


The radial stress ¢, can be found by substituting Equation (24) 


in the expression 
(99 — 
o, = ——— dr (26) 
b r 


Equations (26) and (24) are used to determine the radial and the 
tangential stresses. The axial stress can then be determined from 
Equations (22) and (13). 

It will be noted that these expressions contain the axial strain 
rate which is known for the biaxial tests and is assumed zero for 
the pure internal pressure test. The constant c;can be evaluated 
if some relationship between ug and u, is assumed and if ug at 
some given value of r is known. The procedure to be followed, 
then, depends upon the type of loading. For the pure internal 
pressure tests it is assumed that u, is zero and that ug at r = b 
can be obtained from the creep curves. For the biaxial tests it 
is assumed that ug = u, at some given value of the radius (say 
r =r,). The constant c; can then be determined in terms of u, 
which can be determined from the creep curves. 


Creep Tests in Uniaxial Tension 


The data from the uniaxial tension tests are listed in Table 2. 
The minimum creep rates are plotted in Fig. 3. The exponent for 
the power function relationship of Equation (9) is n = 6.29. 
This particular lot of type 316 stainless steel was rather soft. In 
each specimen plastic deformation occurred on loading before 
the stress reached 12000 psi. 

Only 3 of the specimens fractured. There was very little 
localized necking, but many cracks opened in the vicinity of the 
fracture. This accounts for the low values of reduction in area in 
otherwise ductile material. It is interesting to note that at a 
stress of 18300 psi the transition point occurred at 4 per cent 
strain and at 220 hours, while at a stress of 11450 there is no indi- 
cation of third stage creep after 10200 hours and 4.9 per cent 
strain. 


Table 2 Test data for uniaxial tension creep test 


Creep Tests on Tubular Specimens 


Of the 10 tubular specimens tested, 5 were loaded with pure 
internal pressure and 5 with equal biaxial tensions. In the biaxial 
tension tests the average axial stress was equal to the average 
tangential stress. The creep curves for the biaxial tension tests are 
shown in Fig. 4 while those for the pure internal pressure tests 
are shown in Figs. 5 and 6. Specimens T-8, T-9, and T-10 are 
extra heavy-walled tubes. The extensometers could not be left 
on the specimens until fracture occurred, hence the rupture 
points are not generally shown on the creep curves. The data 
relating to these tests are shown in Table 3. 

Since there is localized flow in both the axial and the tangential 
directions in the neighborhood of the fracture, the strain at this 
point is not easily determined. The fracture strains listed in this 
report have been obtained in the following manner. After frae- 
ture the wall thickness at the edge of the rupture was measured. 
From these measurements an average conventional radial strain 
was calculated, and from this conventional strain the correspond- 
ing natural strain was determined. For the tests under biaxial 
tension it was assumed that the strain in the axial direction was 
equal to the strain in the tangential direction and that each 
was equal to one half the average radial strain but of opposite sign. 
The average radial strain then corresponds to the effective strain 
at rupture. For the pure internal pressure test it was assumed 
that the axial strain was zero. The tangential strain at fracture 
then is equal to the radial strain but of the opposite sign. These 
average values were then substituted in Equation (5) to obtain 
the effective strain at fracture. The values of effective strain at 
rupture listed in Table 3 appear to be erratic, and this, no doubt, 
is due to the large number of cracks in the vicinity of the fracture. 
Some idea of the cracking near the fracture may be gained from 
the photograph of a ruptured specimen shown in Fig. 7 


a 

a 

25, 

Q 25 

20} 

2 

he) 
2 
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Users - Effective Creep Rate |/hr 


Fig. 3 Stress versus creep rate for uniaxial tension tests on type 316 
stainless steel at 1200 F 


Type 316 stainless steel 


Specimen no. 1 2 
Temperature, deg F 1200 1200 
Stress, psi 23400 18300 
Rupture, hr 213 1248 
Rupture strain, per cent 28.0 39.3 
Minimum creep rate, 1/hr 4.4 10-4 
Reduction of area 29.7* 31.9° 
Time to 0.5% strain, hr 0 0 
Time to 1.0% strain, hr 0 0 
Time to 3.0% strain, hr 5 145 
Transition point, hr 55 220 
Transition strain, per cent 6 4 
Original Vickers hardness 146 139 


Effective strain at rupture, per 
cent® 


35.2 38.5 


* Natural strain, based on reduction in area. 


3 4 5 6 
1200 1200 1200 1200 
28300 15300 10450 11450 
37.5 >620 >868 > 10200 
21.5 

2.8 10-3 3.1 10-* 5.2 xX 
23. 8° 

0 0 1000° 800 

0 125 ahs 1800 
1 625° a 5820 
15 

9 vr >4.9 
139 140 150 155 
27.2 


* Reduction in area figures are low due to the large number of voids opened within the material. 
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Evaluation of Stress Components 


analytically. 
tegrated numerically. 


The stress distribution in the tubular specimens was calculated 
according to the equations developed earlier in this paper. 
the pure internal pressure tests, u, in Equation (24) was assumed 
to be zero which made it possible to integrate Equation (26) 
For the biaxial tension tests Equation (26) was in- 
The summation was made by starting at 
r = b where a, was known to be zero. 

The first step in evaluating the stresses was the evaluation of 
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. Fig.4 Creep of type 316 stainless steel at 1200 F, biaxial tension 
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Fig. 7 Typical fract 
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Fig.5 Creep of type 316 stainless steel at 1200 F, pure internal pressure Fig.6 Creep of type 316 stainless steel at 1200 F, pure internal pressure 


For 


of 


the constant n in Equation (9) from the uniaxial tension tests. 
The value is obtained from the slope of the curves in Fig. 3. 
There may be some arguments in favor of determining the value 
of both B and n from the uniaxial tension tests. If both con- 
stants were determined from Fig. 3, the calculated points for the 
biaxial tests would fall directly on the line drawn in Fig. 3, and 
any discrepancy between test results and theory would show up 
as a difference between a calculated and a measured internal pres- 
sure or axial load. Since both the effective stress and the effective 
strain rates vary through the wall of the tubular specimen, the 
results of a test on such a specimen will show up as a segment of 
a line or curve rather than as a point in a plot like that of Fig. 3. 
In the present analysis the exponent n was determined from the 
uniaxial tension tests while the constant B was allowed to take 
on the value that would make the calculated end load equal to the 
measured axial load. The result of each tubular test will then 
show up as a straight line segment parallel to the curve for the 
uniaxial tension tests in a plot like that of Fig. 3. 

The calculation of the stresses in a biaxial tension test can then 
be carried out in the following manner. The constant c; is 
evaluated in Equation (18) by assuming that ug is equal to the 
known u,atr =r, The effective strain rate u.r can then be ob- 
tained as a function of the radius from Equation (20). Since the 
value of the constant B in Equation (9) is not to be evaluated 
from the uniaxial tension tests, and since by Equation (18) the 
constant c,; is proportional to the axial rate u,, all of the constants 
in Equation (24) including the axial rate u, can be grouped in one 
unknown constant A and this equation can be written as 


— @,) 
—— = fir) 


where f(r) is a known function of the radius. Integration of 
Equation (26) then yields the value of o,/A and the constant A 
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can be evaluated by setting ¢, equal to —patr =a. The tan- 
gential stress og is then obtained from Equation (24), and the 
axial stress o, from Equations (13) and (22). The effective 
stress d.¢ can then be evaluated from Equation (23). 

If the calculated end load obtained from integrating the axial 
stress over the cross-sectional area agrees with the load deter- 
mined by the applied tensile load and the internal pressure, the 
stress distribution is assumed to be correct. If the two loads do 
not agree, a new estimate of the radius at which the tangential 
and the radial strain rates are equal must be made and the work 
repeated until agreement between the two loads is reached. A 
comparison of the measured and calculated loads is shown in 
Table 4. Where two listings are shown for one specimen, the 
first set of values represent the original estimate of r* the radius 
at which it was assumed that ug was equal to u, and the second 
represents an improved estimate of r*. 


Table 4 Comparison of measured and calculated axial loads 


Meas- 
ured 
load 

3840 

3640 

3840 
11950 

11950 

9975 

10920 

9000 

2660 
10200 

2330 


ternal 
Pressure load 
15000 0 
14000 0 
13400 0 
16000 7950 
16000 7950 
13300 6650 
14600 7270 
12000 6000 
24000 0 
23000 7650 
21000 0 


Caleu- 
lated 
load 
3820 
3645 
3730 
12600 
11600 
9850 
10900 
8950 
2685 
10170 
2430 


Specimen 
316 T-1 
316 T-2 
316 T-3 
316 T-4 
316 T-4 
316 T-5 
316 T-6 
316 T-7 
316 T-8 
316 T-9 
316 T-10 


0.446 
0.455 
0.446 
0.445 
0.446 


0.446 
0.446 
0.446 
0.445 
0.446 


0419 0.419 


In this table r* is the radius at which it was assumed that ug was 
equal to u,. 


The agreement of the calculated and measured values of the 
axial loads for the pure internal pressure tests was fairly good as 
can be seen in Table 4. These values, however, could be made to 
check more closely by assuming a value other than zero for the 
axial strain. This has not been done, however, for no measurable 
amount of axial strain was observed in any of these specimens. 

Typical stress distribution curves for a biaxial tension test and 
a pure internal pressure test are shown in Figs. 8 and 9. These 
curves are for the specimens with the heaviest walls and the high- 
est pressures. For the pure internal pressure test the three stress 
components gradually increase with the radius. The maximum 
tensile stress is the tangential stress at the outside radius. For 
the biaxial tension tests the axial stress was less than the tangen- 
tial stress at the inside surface but increased more rapidly and 
became greater than the tangential stress at the outside surface 
where it was the greatest tensile stress in the tube. 


Comparison of Uniaxial and Biaxial Results 


The minimum creep rates for the uniaxial tests shown in Fig. 3 
are replotted in Figs. 10 and 11 and the effective stresses and 
strain rates for the tubular specimens have been plotted there 
also. 

The data are plotted on two curve sheets for ease in reading. 
The tubular test results show lower strain rates than would be 
predicted from the uniaxial tension tests. This discrepancy ap- 
pears to become greater as the stress is reduced. As far as mini- 
mum creep rate is concerned, there seems to be no significant dif- 
ference between the results of the pure internal pressure and the 
biaxial tension tests. For example, in Fig. 11, the specimens 
which represent the best and the worst agreement with the uni- 
axial tension tests were both loaded by pure internal pressure. 

The manner in which the results of the tubular specimens de- 
viate from the results of the uniaxial tension tests is not fully 
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Fig.8 Stress distribution in thick-walled specimen T-8, type 316 stainless 
steel at 1200 F, pure internal pressure 
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Fig. 9 Stress distribution in thick-walled specimen T-9, type 316 stain- 
less steel at 1200 F, biaxial tension 
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Fig. 10 Comparison of uniaxial and biaxial creep tests type 316 stain- 
less steel at 1200 F 
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Fig. 11 Comparison of uniaxial and biaxial creep tests type 316 stain- 
less steel at 1200 F 


o 


understood and no adequate explanation can be given at this 
time. If the tubular specimens were to be analyzed without the 
uniaxial tension tests, it appears that a better fit would be ob- 
tained by using a larger value of n. 
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Fig. 12 Comparison of rupture times, type 316 stainless steel at 1200 F 


Comparison of Rupture Times 


A plot of effective stress versus rupture time can be used to 
compare the rupture properties of the biaxial and the uniaxial 
tests. Such a curve is shown in Fig. 12. Each rupture of a tubu- 
lar specimen is represented by a straight vertical line. The upper 
end of the line represents the stress at the inside surface. If the 
effective stress at the outside surface were used as a means of com- 
parison, the tubular specimens would have a low rupture life. 
If the comparison were based upon the stress at the inside surface, 
the tubular specimens would have a longer life than the uniaxial 
specimens. The stress at the equivalent radius is not much higher 
than it is at the outside surface. Thus if the stress at the equiva- 
lent radius were used for comparison purposes, the tubular speci- 
mens would still have an unfavorable life. This shorter life for 
the tubular specimens is partly due to the large increase in tan- 
gential stress during the third stage of creep. 


The Appearance of Fracture 


The fracture shown in Fig. 7 is typical of all the biaxial tension 
fractures. The fractures in the specimens loaded with pure in- 
ternal pressure were quite similar except that in those cases all 
the small cracks were longitudinal. The fractures all started as 
grain boundary cracks at the outside surface. The oxide coat- 
ings on the fracture surfaces show that the surface cracks that 
ultimately lead to failure progress about ?/; of the way through 
the wall at a slow rate before catastrophic failure occurs. 

Examination of the outside surfaces of the fractured specimens 
indicates that practically the entire outside surface contains 
small grain boundary cracks. The inside surfaces also contained 
some longitudinal cracks, but these were not as deep as the ex- 
ternal cracks and they may not have formed until the time of the 
actual fracture when the tube wall is subjected to rather violent 
bending. 

The fact that the destructive cracks started at the outer surface 
may be explained by the higher tensile stresses appearing there 
as shown in the stress distribution curves. This higher tensile 
stress seems to overshadow the higher effective stress and the 
greater strain that occurs at the bore. 

_ It is quite possible that the atmosphere or environment had an 
effect on the grain boundary cracking. The outside surface was 
exposed to an air atmosphere, while the inside surface was ex- 
posed to an atmosphere of superheated steam. It should be re- 
membered that there was no flow past this inside surface, how- 
ever, and the atmosphere there would not be typical of that in 
actual steam piping. Apparently, though, the steam atmosphere 
that was present was not particularly damaging. 

Another factor that may tend to cause the cracking to be more 
pronounced at the outer surface is the manner in which the hy- 
drostatic component of the stress varies with the radius. The 
hydrostatic component of the stress is the normal stress on the 
octahedral planes @oc and is given by the expression 
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3 3 


(28) 


This quantity has been evaluated for one of the extremely heavy- 
wall specimens, T-9. The results are plotted in Fig.9. The hy- 
drostatic stress is 8800 psi compression at the bore and 11000 psi 
tension at the outside surface. It is known that the ductility of a 
material is reduced if the ratio of maximum tensile to maximum 
shearing stress is increased. A triaxiality factor! which would be 
analogous to this ratio and which would contain the stresses in 
the three directions could be written as 
TF « V 2 Geet V2 
Toct — + (a2 — 93)? + (os — 
(29) 


The coefficient in this ratio is introduced to make the triaxiality 
factor equal to +1 for uniaxial tension. It then has the value of 
zero for torsion and —1 for uniaxial compression. In Fig. 9 this 
triaxiality factor is also plotted against the radius. It varies 
from a value of —1.15 at the bore to a value of +1.925 at the 
outside. Thus it would be expected that the material at 
the outside surface would be less ductile than that near the inner 
surface. 


Practical Use of Test Data 


Since the determination of the test data shown in Figs. 10, 11, 
and 12 required the integration of Equation (26), and since this 
in some cases had to be done numerically, it would be convenient 
if the data could be interpreted in a more simplified manner. 
This might be done if an effective stress based upon the average 
values of the three principal stress components would be used. 
Such an effective average stress could be defined as 


1 
= J/2 [(Faav) 


+ (Or(avy — Os(ev))? + (Cstav) — (30) 


Where the average values of the stress components for the tubular 
specimens are determined from the following simple relationships 


= (31) 
P b-a 
Cun = (32) 
pa* Ww 


Cxav) = + x(b? — a?) (33) 
In these expressions W is the applied axial load, p is the internal 
pressure, a is the inside radius, and b is the outside radius. The 
value of this effective average stress has been calculated and is 
listed in Table 3. In order for such a stress to be useful, it is 
necessary to relate it to some particular value of the effective 
strain rate. For comparison with the behavior under uniaxial 
tension the effective average stress Oavjers iS plotted against the 
effective strain rate at r = r, in Fig. 13. This type of plotting 
moves the test points for the tubular tests to the left or in the 
direction of more conservative design. For a given calculated 
value of the effective average stress the actual effective strain 
rate would be less than that calculated from uniaxial tension 
data. 


1 This factor has been introduced in a paper, “Stress Distribution 
and Plastic Deformation in Rotating Cylinders of Strain Hardening 
Material,”’ by E. A. Davis and F. M. Connelly, Journal of Applied 
Mechanics, vol. 26, Trans. ASME, vol. 81, 1959, pp 25-30. 
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Fig. 13 Plot of effective average stress versus minimum creep rate at 
the equivalent radius for type 316 stainless steel at 1200 F 
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Fig. 14 Comparison of rupture times, type 316 stainless steel at 1200 F 


This effective average stress can also be used in agtime to rup- 
ture plot as shown in Fig. 14. In this plot the rupture points for 
all the biaxial tension tests fall on the safe side of the curve for 
uniaxial tension. It is of interest to note that the points for the 
pure internal pressure tests on the extra heavy-walled tubes also 
fall on the safe side of the curve, but that the pure internal pres- 
sure tests on tubes with OD to ID ratio of 2 have shorter rupture 
times than would be predicted by the uniaxial tension tests. It 
should be pointed out, however, that the difference between the 
pure internal pressure test and the uniaxial tension tests de- 
creases as the stress decreases. Thus it would appear that the 
uniaxial tension creep rupture data are safe bases for design if the 
design stresses are calculated in terms of an effective average 
stress. This is particularly true, according to the present test 
data, if the design is to be based on creep rate criteria. It may 
not be completely true if rupture time is the basis for design, al- 
though it tends to come nearer to the true values of rupture time 
at smaller stresses. 

It might be of interest here to compare the effective average 
stress approach to the design of steam piping with the formula 
given in the ASME Boiler and Pressure Vessel Code. The expres- 
sion for the minimum wall thickness of steam piping as given in 
Paragraph P-23 of Section 1 on Power Boilers is 

pD 34) 

2SE + 2yp 

where ¢,, is the minimum wall thickness in inches, p is the maxi- 
mum internal pressure in pounds per square inch, D is the outside 
diameter of the pipe, E is the efficiency of a longitudinal weld and 
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will be considered here as unity, c is an allowance for corrosion 
and will be neglected in this comparison, y is a factor depending 
on the material and the temperature and for the present case is 
0.7, and S is an allowable stress due to internal pressure. The 
value of S for type 316 stainless steel at 1200 F according to the 
Boiler Code is 6800 psi. 

The maximum pressure for a tube whose outside diameter is 
twice the inside diameter can be determined from Equation (34) 
by letting D = 4t and solving for p with S and y as prescribed. 
The pressure so determined is 5225 psi. The effective average 
stress for such a tube as obtained from Equations (30)—(33) is 
6800 psi which agrees exactly with the value of S given by the 
boiler code 

According to the present test data and method of an lysis such 
a tube would creep at a rate lower than that of a uniaxial tension 
specimen at 6800 psi, but it might rupture in a slightly shorter 
time. 

For the extra heavy-walled tubes where the outside diameter is 
3 times the inside diameter the maximum allowable pressure as 
given by the Boiler Code formula is 8500 p.i. The corresponding 
effective average stress for this tube is 7400 psi which is con- 
siderably higher than the allowable value of 6800 psi. When com- 
pared with uniaxial data the effective average stress would then 
lead to a much more conservative design than would be obtained 
from the Boiler Code formula. The test data indicate that a tu- 
bular specimen of such cross section would creep more slowly and 
rupture later than a uniaxial tension specimen at 7400 psi. There 
may be some question whether or not it would behave better 
than a tension specimen at 6800 psi. 


Conclusions and Remarks 


The analysis of the combined stress creep results presented 
here has shown that both combined stress and uniaxial stress re- 
sults can be fitted to the same creep law. The analysis, however, 
did indicate a deviation of the combined stress results away from 
those for uniaxial tension. It is hoped that this deviation can be 
explained in the future. 

It should be remembered that the analysis presented here is not 
the only possible or even the most preferable analysis. As men- 
tioned earlier, the power function law was adopted because it led 
to expressions that could be integrated under favorable condi- 
tions. 

A better analysis might be made if an extensometer capable of 
measuring both the axial strain and the increase in diameter 
could be developed. 

In the present investigation the distribution of the strains 
could be completely determined by making use of certain theoreti- 
cal considerations. The general problem of strain distribution 
thus was not studied. The assumptions involved in adopting 
Equations (10) to (12) were less important in these tests than 
would have been the case if other ratios of axial to tangential load 
had been tested. 

The effective average stress as defined in Equations (30)—(33) 
provides a simple means of evaluating a state of combined stress 
in terms of uniaxial tension data and compares quite closely with 
the Boiler Code formula for tubes in which the outside diameter 
is twice the inside diameter. 

For the tubes with the heaviest wall, OD = 31D, the effective 
average stress leads to a more conservative design than that based 
on the Boiler Code formulas. 


1960 / 461 


| 

? * 

ss 
| 7 02 
4 

| 

6 8 
a 

|_| é ~ 

| 

a 
a 

£ 

4 


JAIN FINNIE 


Engineer, Shell Development Company, 
Emeryville, Calif. Assoc. Mem. ASME 


Steady Creep of a Tube Under Combined 
Bending and Internal Pressure 


It is shown that for a tube in bending the steady-state bending creep rate may be greatly 
increased by the addition of internal pressure. If the maximum bending stress is small 
compared to the pressure stress, a very simple expression is obtained for the ratio of 
bending creep rates with and without internal pressure. Experiments on lead tubes show . 


Ra MANY MATERIALS at elevated temperatures the 
curve of strain, as a function of time in a tension test at constant 
temperature, shows a well-defined region of steady-state creep. 
It is then often permissible for convenience in strain calculations 
to replace the strain-time curve by the straight line shown in Fig. 
1. The creep rate is then only a function of stress. The method 
of predicting strains under multiaxial stresses from these tensile 
data is fairly well established. Solutions have been given, for 
example, for thin and thick-walled cylinders under internal pres- 
sure [1],! tubes in bending [2], and rotating disks [3]. 


Creep Strain 
\ 


Time 


Fig.1 Schematic constant stress creep curve and straight line idealization 


An interesting case of combined stress creep occurs when bend- 
ing moments and internal pressure act together on a thin-walled 
tube. This problem was originally mentioned by Bailey [4] and 
his comprehensive 1931 creep paper, but has not received subse- 
quent attention in the literature. We will show that the bending 
strain rate under the combined loading may be considerably 
greater than that due to bending alone. Such a magnification, 
which of course does not occur with elastic stresses, may be of 
considerable importance in the design of high-temperature 
piping. 


Analysis of Strains 


As a starting point we assume that the steady-state creep rate 


€ and stress o in a tensile test are related by the empirical creep 
law 


€ = Bo" (1) 

' Numbers in brackets designate References at end of paper. 

Contributed by the Metals Engineering Division and presented 
at the Metals Engineering Conference, Albany, N. Y., April 29- May 
3, 1959, of Tue AmeRICAN SocteTy oF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, February 
3, 1959. Paper No. 59—Met-5. 
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good agreement with the theory. 


where B and n may be functions of temperature. The extension 
of Equation (1) to multiaxial strain prediction is given in many 
places in the literature (see, for example, reference [1]) and need 
only be outlined here. 


Briefly, one assumes that the stresses and strain rates in steady- 
state creep are related by the expression €* = Bo** in which the 
invariant quantities €* and o* are given in terms of the principal 
stresses and strains by 


1 
V2 — o2)* + — + — 


9 


Making the gdditional conventional assumptions that the ma- 
terial is incompressible during plastic deformation (creep) and 
that at any location the principal shear strain rates are propor- 
tional to and in the same direction as the principal shear stresses 
leads to two further equations 


4+&@+6=0 


and 


— & — & & — & 
0, — G2 


— G3 


Combining these two equations leads to 


; 2 1 
= 3 - 2 (o2 + 


and two similar expressions for é and &. Using the assumed re- 
lationship between stress and strain-rate invariants, this may be 
written 


é, = Ba**-1 ; (o2 + | 


For a tube under combined bending and internal pressure the 
principal stresses, o,, 7, 7,, lie in the tangential, axial, and radial 
directions. If we consider a thin-walled tube, and thus ignore ¢,, 


the axial strain rate €, at a given location may therefore be 
written as 


é, = Blo? — + * («. = (2) 
The tangential stress may be written directly in terms of the in- 
ternal pressure p, inside tube radius r, and tube wall thickness ¢ 
as o, = pr/t, but the distribution of the axial stress has yet to be 
determined. 
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We make the conventional assumption that plane sections re- 
main plane in bending.? The variation of axial strain rate over 
the cross section can then be written as 


= (3) 


where €, and €,; are the axial strain rates at distances y and y, from 
the neutral axis for bending. Later we will take y; as the distance 
to the outer fiber. If the corresponding axial stresses are a, and 
Oa1, Equations (2) and (3) combined lead to 


2 


n—1 


We can simplify the form of Equation (4) by writing 


+ bo, 


Ca = 


+ b,o, 


Since the axial stress due to internal pressure acting alone is o, 
= ¢,/2, we can interpret bo, as the amount by which the com- 
bined loading increases the axial stress due to pressure alone. It 
is not necessarily the stress that would be produced by the bending 
moment acting alone in the absence of internal pressure. 
ing Equation (5) in Equation (4) leads to 


Insert- 


(0.75 +b?) 


= b,(0.75 + b2) - (6) 
Two other equations may be obtained by considering force and 
moment equilibrium. Although a solution of these equations and 
Equation (6) is then possible in principle, it is not convenient in 
practice. 

A greatly simplified solution is possible for the very common 
practical condition in which the bending stresses are small com- 
pared to the tangential stress due to internal pressure. If this is 
the case, we can neglect b,? and 6? relative to 0.75 and Equation 
(6) becomes 


b b, (7) 


That is, the additional stress due to bending varies linearly over 
the cross section and hence may be calculated as if it were an 
elastic stress. On the other hand, if the bending stresses are large 
compared to pressure stresses, then the bending stress distribution 
approximates that due to bending acting alone. Robinson [2] has 
shown that the ratio of the maximum stress ¢, in creep bending of 
a circular cross section to the maximum elastic stress ¢, which 
would be produced by the same bending moment is 


r (: + 
8 1 — 2n 
(3+ 2) vs r (15+ 3) 
n on 
(8) 


where R; and R, are the inner and outer radii and T is the gamma 
function. 

The axial strain rate at the outside fiber distance y, from the 
neutral axis under combined bending and pressure is given by 
Equations (2) and (5) as 


? This is only strictly true for pure bending. For bending moments 
which do not vary too rapidly, we assume that the action of shear 
forces in distorting the otherwise plane section may be neglected. 
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és, = Bo,(0.75 + b, 


The strain rate €,, at the outer fiber due to bending acting alone 
would be 


= 


where b, = ¢,/a0,and o, may be obtained by combining Equation 
(8) and an elastic bending calculation. 

We can now write simple expressions for the ratio of the axial 
strain rates with and without internal pressure for two cases: 
Maximum bending stress small relative to tangential pressure 
stress (b;?< 0.75). 


(9) 


where ¢g,/¢, is obtained from Equation (8) and b, = @,/a;. 
Maximum bending stress large relative to tangential pressure 
stress (b,2 > 075). 


Experiments 


To study the use of the preceding equations, tubes were tested 
under combinations of internal pressure and bending at 120 F. 
The material used for the experiments was lead containing 6 per 
cent antimony. When tested in a constant temperature water 
bath at 120 F, this material was found to give very reproducible 
creep data. For the stresses studied (300 to 1000 psi) it exhibited 
a very well-defined steady-state creep between about 20 hours 
after loading and the end of the test at 100 hours. To decrease 
the possibility of metallurgical changes during the test the speci- 
mens were held at 160 F for several hours before being tested at 
120 F. While lead is far from a high-temperature material, its use 
has many advantages. Its creep curves at low temperatures and 
stresses are reasonably similar to those of heat resistant alloys at 
high temperatures and stresses. Methods of multiaxial stress 
analysis based on tension data, if they are to be generally ap- 
plicable, should apply to all creeping materials provided the ten- 
sion creep curves are reasonably similar. Strain measurements 
during the test were made by wire resistance strain gages glued 
to the specimens. Comparing the strains of specimens with and 
without strain gages showed that the stiffening effect of the gage 
could be neglected. 

For a time scale in hours and stresses in psi, the expression fitted 
to the tension test data was 

é = 8 X (11) 
It is perhaps worth pointing out that an expression such as Equa- 
tion (11) may be obtained from a single bending test with strain 
gages at several locations rather than from a number of tension 
and/or compression tests. 

Fig. 2 shows strain readings for three axial locations on a tube 
loaded first by bending moment alone and later by the same bend- 
ing moment combined with internal pressure. Since most of the 
data points lie within the widths of the lines in Fig. 2, they are 
not shown. The maximum tension and compression creep rates 
at the three cross sections for the two loading conditions are shown 
in Table 1. Also shown in Table 1 are the calculated maximum 
tensile stresses for elastic bending and creep bending and the 
ratios of these stresses to the tangential pressure stress (i.e., b, 
and 6.). In this particular test, the stresses were chosen so that 
the values of 6 would lie between those for which Equation (9) 
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Fig. 2 Strain as a function of time for a tube in bending 


Internal pressure added at 46 hours. Gages 17 and 18 lie in the tan- 
gential (circumferential) direction. The others lie in an axial direction. No 
distinction is made between tensile and compressive strain. Note that 
Gage 18 reverses sign at 56 hours. 


(b,? < 0.75) and Equation (10) (b,2 > 0.75) are strictly ap- 
plicable. 

In Table 2 the ratio of axial creep rates with and without in- 
ternal pressure is given. The average values from tension and 
compression gages may be compared with values predicted by 
Equations (9) and (10). This table shows very satisfactory agree- 
ment between experiment and analysis. For Location 12, 13, at 
which the bending stress is small relative to the pressure stress, we 
see that Equation (9) gives close agreement with the experimental 
value. As the bending stress increases, the ratio is overestimated 
by Equation (9) and agrees more closely with Equation (10). 


Summary 


The steady-state axial creep rate of a tube in bending may be 
greatly increased by the presence of internal pressure. When the 
maximum bending stress is small compared to the pressure stress, 
the bending stress under the combined loads can be shown very 
linearly over the cross section and thus may be calculated as if 
it were an elastic stress. As the bending stress increases, the 
bending stress distribution under combined bending and pressure 
approaches that corresponding to creep bending. For these two 
extreme cases, the factor by which the bending creep rate is in- 
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Table 1 Axial creep rates and stresses corresponding to Fig. 1 


Elastic bending Creep bending 
Ge, PSi be = Oc, Psi be = 


650 0.929 547 0.782 


515 0.735 434 0.620 


355 0.507 298 0.425 


Table 2 Ratio of axial creep rates with and without internal pressure 
(Axial creep rate under bending and pressure ) 


(Axial creep rate under bending only) 


Location Experimental Equation (9) Equation (10) 
19 2.19 
16 2.34 
Average 16, 19 2.26 2.41 2.31 


14 3.02 
15 2.50 
Average 14, 15 2.76 2.87 2.56 
12 4.76 
13 3.54 
Average 12, 13 4.15 4.13 3.30 


creased by internal pressure is given by Equations (9) and (10), 
respectively. For any intermediate case such as Location 14, 15 
of Tables 1 and 2, the factor lies between the values given by 
Equations (9) and (10). 
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DISCUSSION 
K. R. Merckx® 


The special case of creep bending of pressurized tubing under 
small bending moments may also be physically analyzed in the 
following manner: The axial strain and stress deviation of 
pressurized tubes are zero. The addition of small bending 
stresses, 0,,< 0 where 0 = flow criteria, will not appreciably 
alter the flow criteria, but will cause these axial strain or strain 
rates 


9 

° <0, o 

€, or €, ~ f(a) -, 
« eff 


where f(a) is any yielding function. The effective modulus 
E.« can be experimentally determined as the ratio of the creep 
stress o divided by the creep strain € measured at the expected 
design life. Since for small bending stresses E.,; is a constant 
for given operating pressure, elastic methods of calculation can 
be used to determine relative bending deflections. This argument 
fails for bending stresses which are not small compared to the 
pressure stresses; thus methods similar to those described by 
the author must be used. The author’s recognition of the effect 
of small bending stresses on creeping pressurized tubing is an 
important factor which is of use in many engineering applica- 
tions. 


* Hanford Laboratories Operation, General Electric Company, 
Richland, Wash. Mem. ASME. 
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Effect of Wall Thickness on Stress- 
Rupture Life of Tubular Specimens 


Stress-rupture data obtained from tubular specimens stressed with internal pressure 
are compared with data from standard tension-bar specimens from the same heats of 


material. Agreement between the data of the two types of specimens is poor for thick- 
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Introduction 


* THE DESIGN OF PRESSURE VESSELS and tubes 
that operate at high temperatures, stress-rupture is one of the 
more important properties of materials to be considered. Stress- 
rupture data indicate to the designer what service life can be ex- 
pected of a material subjected to a specific stress and temperature 
before failure occurs. For this reason, stress-rupture properties 
of materials have been studied for many years, and abundant data 
on most commonly used materials are available in the literature. 

Most of these data were obtained from cylindrical specimens 
machined from bar stock and loaded in pure tension, a condition 
which is rarely duplicated in actual service. These bar speci- 
mens do not account for metallurgical changes in the material 
which may occur during fabrication of parts, and the uniaxial 
tension stress does not duplicate the biaxial or triaxial stresses 
normally found in pressure vessels and tubes. Furthermore, al- 
though the initially applied stresses may be the same, the dif- 
ferences in the effect of creep on a bar specimen and on a tube can 
produce a difference in stress over a period of time. 

For these reasons, the authors’ company began a test program 
ten years ago to investigate the stress-rupture properties of ma- 
terials in tubular form, stressed by internal pressure. Some of 
the early results have been reported in a previous paper.'. The 
results of these continuing tests have proved to be quite fruitful 
and have provided information that could not be obtained 
by other methods. The tests have yielded information on the 
effect on stress-rupture life of physical and metallurgical discon- 
tinuities and stress concentrations, such as those occurring in 
welded tubing and tube bends. Also, since the tests included a 


1 L. F. Kooistra, R. U. Blaser, and J T. Tucker, Jr., ‘“‘High-Tem- 
perature Stress-Rupture Testing of Tubular Specimens,’’ Trans. 
ASME, vol. 74, 1952, pp. 783-791. 
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at the Metals Engineering Conference, Albany, N. Y., April 20-May 
1, 1959, of Tae American Society oF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, February 
24,1959. Paper No. 59—Met-11. 


wall tubes when the hoop stress in the tube wall is calculated on an average stress basis. 
Better agreement is obtained with thin-wall tubes. 


These results show that any design 


formula that neglects the variation of stress through the thickness of a tube wall is un- 
satisfactory for thicker tubes. 

A dimensionless parameter method is presented which provides satisfactory correlation 
of the data for three materials, four test temperatures, and a wide range of wall thickness. 
Several areas in which additional research would be of value are discussed. 


wide range of outside-diameter-to-thickness ratios for several 
materials, the opportunity was afforded to observe the effect of 
tube wall thickness on stress-rupture life. 

Obviously, the stress-rupture properties of the material are not 
changed as the wall thickness is varied. Therefore, the apparent 
change in rupture properties reflects the inability of the method of 
calculating stress to predict the stress condition that causes 
failure. It is shown that better correlation of bar specimens and 
tubular results can be obtained when the stress in the tube is cal- 
culated by a method that predicts a stress gradient in the tube 
wall, rather than assuming an average stress. The data for 
all wall thicknesses, materials, and temperatures covered in the 
tests are correlated reasonably well by use of dimensionless param- 
eters. In this way all the results are compared and there is no 
need to resort to stress formulas which may not be theoretically 
applicable. 

The combination of loads imposed on tubing in service varies 
considerably, depending on the design of the structure in which 
they are used. The data presented herein are applicable to only 
one common condition of loading, and further work is desirable 
to determine the effects of other combinations. 


Materials and Apparatus 


Three typical superheater materials were used in the tests re- 
ported in this paper: 


1 Carbon steel, SA-210 
2 Low alloy steel, SA-213, T22 (2'/,-Cr 1-Mo) 
3 Austenitic steel, SA-213, TP321 (18-Cr 12-Ni Ti) 


Their physical properties, chemical compositions, heat-treat- 
ments, and ASTM specifications are listed in Tables 1, 2, and 3. 
It is apparent that the specimens were representative of their 
respective specifications. All of the tubing of each type and the 
bar specimens with which they are compared were taken from 
the same heat of material and were heat-treated together so that 
their physical properties and microstructure were similar. 
Tubular Specimens. A typical tubular stress-rupture specimen 
is shown in Fig. 1. It consists of a 17-inch length of tubing whose 


Nomenclature 


S = average hoop stress, psi 
S,, = maximum hoop stress, psi 


S, = stress required for rupture in a bar 
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specimen in a specified time, psi 

P = internal pressure, psig 
P; = internal pressure required for rup- 
ture in a tubular specimen in a 


specified time, psig 
D; = inside diameter, inches 
D, = outside diameter, inches 
¢t = tube wall thickness, inches 
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Table 1 Physical properties 


Ultimate Yield point, Elongation, 
Material Heat no. strength, psi psi per cent 
2'/,-Cr 1-Mo thick-wall tube H1913L 68700 37000 
2'/,-Cr 1-Mo thin-wall tube 924 65100 47300 
2'/,-Cr 1-Mo ASTM Spec. 60000 25000 
Type 321 thick-wall tube 20526 80900 50000 
Type 321 ASTM Spec. +e 75000 30000 
SA-210 thick-wall tube T1088 63000 43500 
SA-210 ASTM Spec. oe 60000 37000 


Table 2 Chemical composition, (per cent) 


Material Heat No. C Mn P 5s Si Ni Mo 

1-Mo thick- 

wall tube H1913L (0.09 0.41 0.022 0.018 0.29 2.13 ; 0.98 
2'/,-Cr 1-Mo thin- 

wall tube 924 0.10 0.39 0.020 0.019 0.27 2. dans 0.97 
2'/-Cr 1-Mo ASTM 

Spec. ee 0.15 0.30-0.60 0.030 0.030 0.50 90-2.6 - 0.87-1.13 

max max max max 

Type 321 thick- 


wall tube 26 0.065 1.79 0.010 0.025 0.42 2.36 
Type 321 ASTM 
Spec. 0.080 2.00 0.030 0.030 0.75 17.00-20.00 9.00-13.00 
max max max max max 
SA-210 thick-wall 
tube T1088 0.232 0.46 0.010 O01 0.33 Small residuals 
SA-210 ASTM Spec. oka 0.27 0.80 0.058 0.048 0.10 ae 
max max max max min 


Table Heat-treatment 


~ Material Heat no. Heat-treatment 
AN 2'/,-Cr 1-Mo H1913L 1550 F for 1 hour 
g 


SOLID STEEL : 25 F per hour furnace cooled to 
1200 F 
2'/,-Cr 1-Mo 924 Furnace cooled (maximum rate) 
to room temperature 
Type 321 20526 1900 F for '/. hour 
Water quench 
SA-210 T1088 1650 F for '/, hour 
Air cooled to below 400 F 
Fig. 1 Tubular stress-rupture specimen 1300 F for '/: hour 
Air cooled to room temperature 


Fig. 2 Exterior view of stress-rupture apparatus Fig. 3 Interior view of stress-rupture apparatus 
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Fig. 5 Arrangement of one unit of multiple-station stress-rupture apparatus 
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Fig.4 Typical section through one unit of stress-rupture apparatus 


ends are closed by welding on closure heads. Before the heads 
are welded, a solid steel filler piece is inserted into the tube to re- 
duce the internal volume and thus minimize the explosive forces 
when rupture occurs. For most tests, the specimens were made 
from standard commercial tubing with no machined surfaces or 
other refinements; however, the specimens used to study the 
effect of varying the diameter-to-thickness ratios were accurately 
machined on the inside and outside surfaces to eliminate ec- 
centricity and produce a uniform wall thickness. Short lengths of 
*/,-in. OD tubing are welded to the closure heads to support the 
specimen in the center of the cylindrical furnace. A !/s-in. OD 
tube attached to one of the support tubes connects the specimen 
to the pressurizing system. Five Chromel-Alumel thermo- 
couples are located along the specimen to provide a basis for con- 
trolling the temperature of the specimen and the temperature dis- 
tribution along its length. 

Apparatus. The tubular stress-rupture apparatus is illustrated 
in Figs. 2 and 3. It contains twelve separate stations where 
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tubular specimens are stressed with internal steam pressure in a 
controlled temperature furnace until a stress-rupture failure 
results. Located in the front section of the apparatus are the 
These units 
control the saturable reactors that supply continuous power to 
the constant temperature furnaces. ' 


instrument panel and the individual power panels. 


The furnaces are positioned horizontally in their insulated 
cubicles at the rear of the apparatus. A sectional view, Fig. 4, 
shows a tubular specimen, with its instrumentation, installed in 
its furnace ready for testing. The electric furnace, with its 
separately controlled zones, will heat the specimen evenly over 
the central ten inches of the specimen, maintaining a maximum 
temperature gradient over this length of +4 F and holding tem- 
perature variations at any point within the gage length to +3 F. 
An insulated guard panel closes the front of the cubicle and 
causes the force and fragments of rupture to be directed toward 
the back of the apparatus where heavy deflector plates eliminate 
any hazard to operating personnel. 

The rear section of the apparatus also contains the high pres- 
sure circuits used to pressurize the specimens to the desired stress 
level. As shown on Fig. 5, each circuit contains a high-pressure 
reservoir which is held at a constant temperature (220 F) by its 
electrically heated low-pressure water jacket. The reservoir, 
which is the major volume of each circuit, utilizes its elastic dila- 
tion and the compressibility of the water to minimize the effect 
of specimen creep and small temperature fluctuations on the 
specimen loading pressure. Internal specimen pressures, as in- 
dicated by the gages at the sides of the apparatus, are main- 
tained within +25 psig by periodic additions of make-up water 
from an air-driven water pump. 

All of the tension bar specimens were tested with standard 
stress-rupture equipment in which the specimens are stressed at 
elevated temperature by a dead weight lever system until rupture 
occurs. 

Test Procedure. Since there is some eccentricity in most com- 
mercial seamless tubing, it becomes necessary to establish some 
method of describing the dimensions of the specimens. There- 
fore, before the end closures are welded on a specimen, its initial 
dimensions are measured with a micrometer. Outside diameters 
are measured on 90-degree planes at 2'/;-in. intervals to detect 
any large irregularities along the specimen. If large irregularities 
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Table 4 Tubular stress-rupture properties of thin-wall Croloy 2'/, tubing at 1100 F, heat No. 924 


Specimen Actual Tube Dimensions Calculated Testing Conditions 
No, Dimensions 


Stamped End 


Opposite End 


Min. Max. o.D. Min, Max. Avg. Min, Inter. Hoop* Lamé to Elong. 
Wall wall Wall Wall I.D. Wall Press Stress Stress Rupture 
In, In, In. In. In. In Psi Psi 


In. 


2.004 + 206 219 2.003 211 215 1.577 .209 5,570 21,100 24, 200 15 26 


1.998 .212 .222 2.000 .213 .229 1.563 .212 4,680 17,200 19,800 51 28 
2.004 -229 1.997 1.565 3,340 12,800 
214 «1.994.218 =. 227 1.556 .216 3, 300 11,900 13,700 580 15 
1.995 1.576 
1.998 1.567 .206 2,600 9,900 11,300 4, 348 15 
7 2.006 .209 .225 2,002 .208 .220 1.571 .208 2,770 10,400 12,000 4,786 14 
2.003 


1.998 
1.999 


3,120 12,000 13,700 858 15 ™ 


1.996 .223 


-223 2.000 1.579 .202 2,800 11,000 12,500 6,422 13 
1.999 1.579 2, 280 


2.006 


8,900 10,200 


*Hoop Stress Calculated By Thin-Tube Formula 


Table 5 Tubular stress-rupture properties of 2'/,-Cr 1-Mo tubing at 1100 F, heat No. H1913L 


Actual Tube Dimensions Calculated 
Dimensions 


Specimen 


Testing Conditions Results 
No. 


Stamped End Opposite End 


Min. Min. Max. Avg. Min. Inter. Hoop* 
Wall Wall Wall Wall I.D. Wall Press Stress Stress Rupture 
In. In In, In. In, In. In. In, Psi Psi Psi Hr. 


2.497 
2.496 


-560 2.497 .560 - 596 1.336 .56 7,470 8,900 13,700 394 30 


2.496 .575 .580 1.336 .574 7,670 8,900 13,900 507 40 
2.495 .571 .588 1.336 .574 6,030 7,000 10,900 1,744 33 
2.495 1.330 
2.494 


2.496 


2.496 


5,600 6,600 10, 300 4,098 38 


2.496 1.333 .572 5,130 6,000 9,300 5,314 26 


1.628 5,300 


2.497 2.496 


*Hoop Stress Calculated By Thin-Tube Formula 


are found, the specimen is rejected. Tube wall thickness is tained at the proper levels throughout the life of the specimen 
measured at each end of the specimen at eight points 45 degrees by making slight adjustments as required. 
apart. From this information, the point of minimum thickness at When a specimen fails, it is allowed to cool and then is removed 
each end is located. The outside diameter at this point and the from the apparatus. Its dimensions are recorded and it is ex- 
opposite wall thickness are then measured and used to calculate amined metallurgically. 

the inside tube diameter. The dimensions of the two ends are Comparison of Stress-Rupture Data. The physical properties, 
averaged to give an average inside diameter and an average mini- chemical analyses, and heat-treatments of the three materials 
mum wall thickness for each specimen. These two average used in these test series are listed in Tables 1 through 3, and have 
dimensions describe a concentric hypothetical tube and are used already been discussed. Tables 4 through 8 list the specimen 


to calculate the internal pressure required to cause rupture dimensions, test conditions, and results of all tests. 
within the desired test period. 


It is evident 
from these summary tables that only certain facts are accurately 
After a specimen is assembled, it is evacuated and filled with known about a tubular stress-rupture specimen; namely, its 
distilled water. The vacuum makes it possible to fill the speci- material, its dimensions, the test temperature, the test pressure, 
men completely with water through the small-diameter pres- and the time for rupture. While these facts are necessary to 
surizing tubing. The specimen and necessary insulation and make good use of the results, one must still answer the important 
guards are then installed in a furnace of the test apparatus. question of how shall the stress in the tube wall be computed. 
Following installation, the specimen is hydrostatically tested All tubes that are subjected to internal pressure at low tem- 
and then its temperature is raised to the desired value. During peratures are known to have a higher hoop stress at the inside 
the temperature-raising period, a ten-foot standpipe is connected surface than at the outside, provided the stresses are kept within 
to the pressurizing system. This arrangement maintains a low _ the elastic limit. The difference between the stresses on the in- 
pressure on the system, which prevents the high-pressure reser- side and outside surfaces is equal to the internal pressure. Plastic 
voir from boiling dry, and allows the water in the specimen to theory also predicts a stress gradient across the tube wall, but 
expand, When the specimen temperature has stabilized, the pres- with the highest stress being on the outer surface. At elevated 
sure is increased slowly by an air-driven pump until the test temperatures, it has often been assumed that, after a short period 
pressure is attained. The temperature and pressure are main- of time, these stress differences become smaller because of higher 
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Table 6 Tubular stress-rupture properties of thick-wall Type 321 tubing at 1200 F, heat No. 20526 
Specimen Actual Tube Dimensions Calculated Testing Conditions Results 
No. Dimensions 
Stamped End Opposite End 

0.D. Min. Max. Min. Max. Avg. Min. Inter. Hoop* 
Wall Wall Wall Wall I.D, Press Stress 

In. In In. In, In In. Psi Psi 
2.003 2.003 -522 15,000 14,100 
2.003 2.003 .506 .528 12,780 12,200 
2.003 2.003 -515 -518 ° 11,120 10,500 
2.003 2.003 -514 -521 10, 490 9,900 
2.003 2.003 .514 .523 9,870 9,300 
2.003 - 526 2.003 : 9,190 8,700 
2.003 2.003 -512 8,580 8,100 


2.003 ° 2.003 .507 .527 7,910 7,600 


*Hoop Stress Calculated By Thin Tube Formula 


Table 7 Tubular stress-rupture properties of Type SA-210 tubing, heat No. T1088 and Type 321 tubing, heat No. 20526 


Actual Tube Dimensions Calculated Testing Conditions 
Dimensions 
Opposite End 


Min. Inter. Hoop® 
Wall Press Stress 
Psi Psi 


19,000 
19,000 
19,000 
19,000 


11,000 
11,000 


° 
N 
< 
a 


16,500 
24, 200 
20, 800 
18,000 
16,600 


*Hoop Stress Calculated By Thin-Tube Formula 


Table 8 Stress-rupture data from tension-bar specimens 


Specimen Diameter, Stress, Hours to Elongation, Specimen Diameter, Stress, Hours to Elongation, 
no in. psi failure per cent no. in. psi failure per cent 
Material: 2'/,Cr 1-Mo, heat No. 924, 1100 F Material: 18-Cr 12-Ni Ti (Type 321), heat No. 20526, 1200 F 

. 252 17000 : 0.322 20000 572 

252 17000 ; 0.322 17000 1517 

254 17000 ¢ < 0.320 16000 2098 

254 15000 : 0.315 14000 3317 

252 15000 0.321 12000 7999 

.253 13000 7 7 

254 13000 ; ‘ Material: SA-210, heat No. T1088, 850 F and 950 F 

253 12000 236: ¢ 321 28000 238 

253 12000 3255 ‘ : 322 23000 1988 
253 10000 717 K .d2) 21500 
253 10500 321 20000 
Material: 2'/,-Cr 1-Mo, heat No. H1913L, 1100 F 321 17000 
1 0.503 15000 121 ‘ 321 15000 


2 0.503 13500 498 321 13500 
3 0.504 11500 4267 : 0.320 12000 


cucce 


coun wooo 
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-0 
.0 
8 22,900 5,972 9.4 
Spec. Results 
No. 
Stam 
Time cir. 
Min. Max. o.D. Min. Max Avg. Lame te Elong. 
Wall Wall Wall Wall I.D, Stress Rupture a 
In. In In. In. In, In. In. Psi Hr. 
1A 1.940 -466 .468 81.940 .466 .467 1.007 .466 850 171390 30,600 220 54 
2B «1.941 .375 1.941 1.191 .375 850 11,9 26, 400 766 37 
«1.941 -249 «34.251 .250 .250 1.441 .249 850 6,570 22, 800 2,221 38 
4p) 1.941 -156 ©.157 .156 .156 1.629 .156 850 3,630 21,000 3,551 43 
6B 1.940 1.187 .377 950 6,980 15,300 1,290 68 
7C 1.941 «83.252 «61.941 .251 .251 1.440 .25 950 3,820 11,000 13,200 2,734 50 
8p 1.941 .156 6.156 «41.941 .156 .157 1.628 .156 950 2,110 11,000 12,100 4,472 52 
9A 1.939 -467 .468 861.940 .467 .468 1.004 .467 1200 13,950 15,000 24,200 646 7 
10B 1.939 -372 «6.374 «61.939 1.193 .373 1200 9,360 15,000 20,800 875 i2 
lic 1.940 -246 .248 1.940 1.446 .246 1200 5,110 15,000 17,900 871 6 
w 12D 1.935 -154 .156 1.935 .155 .156 1.625 .154 1200 2,850 15,000 o4t 5 
1.940 -468 .470 1.9490 .468~ .469 1.003 .468 1200 14,000 15,000 1,663 
& 1.941 ae 1.191 .374 1200 9,430 15,000 S72 iy 
15e -251 .251 1.942 .251 .252 1.439 .251 1200 5,230 15,000 2,215 3 E 
lod 1.942 -156 .157 1.941 .157 .157 1.628 .157 1200 2,880 15,000 1,452 7 j 
6228 69 
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TUBE STRESS CALCULATED BY THIN TUBE FORMULA 
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RUPTURE TIME -HOURS 
Fig.6 Rupture data for 2'/,-Cr 1-Mo tubing; thin tube versus tension bar 


creep rates in the more highly stressed locations. Such an equali- 
zation of stress across a tube wall would permit estimating tube 
life on the basis of an average hoop stress as calculated by the thin 
tube formula 


D 
S= P 
2t 


where 


average hoop stress, psi 

= internal pressure, psig 
inside diameter, inches 
tube wall thickness, inches 


Hoop stresses calculated by this formula are used in this paper 
as a basis for comparing tubular stress-rupture data with bar 
data. 

The first series of tests was made on specimens from two-inch 
tubing with 0.203-in. minimum wall thickness of 2'/;-Cr 1-Mo 
material. The results of these tests at 1100 F are shown in Fig. 6 
along with the results of bar specimens from the same heat of ma- 
terial. Although the tubular specimens failed at a stress slightly 
lower than that of the bar specimens for any given rupture time, 
the agreement is reasonably good. This difference in stress is 
typical for relatively thin-wall tubing and is probably due to a 
basic difference in the effect of creep on the actual stress in the 
material of the two types of specimen. The nominal stress value 
of both specimens is calculated from the load and the dimensions 
of the specimen at the beginning of a test. As creep occurs, the 
load on a bar specimen remains constant and the cross-sectional 
area decreases somewhat, thereby increasing the true stress in 
the material. However, a double effect of creep on a tubular 
specimen takes place, a reduction of wall tnickness or load bearing 
area and an increase in inside diameter which represents the area 
on which the internal pressure acts. This produces a vicious 
circle in which each increment increases the other throughout the 
life of a specimen. 

The next test series was made with specimens from 2!'/2-in. 
tubing with 0.570-in. minimum wall thickness of 2'/,-Cr 1-Mo 
material. The results of these tests at 1100 F are shown in Fig. 7. 
It is evident that the tubular data fall well below the bar data from 
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the same heat of material when the stresses in the tubes are com- 
puted using the thin tube formula. Of special significance is the 
fact that one tubular specimen, which was bored and reamed to a 
0.427-in. wall thickness, failed at a stress higher than would be in- 
dicated by the data of the thicker-wall tubes. It should also be 
noted from the physical properties, chemical analyses, and 
stress-rupture properties of the bar specimens, that this heat of 
2'/,-Cr 1-Mo material is very similar to the previous heat, making 
possible a direct comparison of three specimens with different 
wall thicknesses whose stresses are calculated by the same 
formula. This comparison points to a relationship between wall 
thickness and rupture stress whereby the thickest tube at a given 
stress level has a much shorter rupture life than the thinner 
tubes. 

It seems obvious that the stress-rupture properties of the ma- 
terial itself are not affected to any great extent by variations in 
wall thickness. It follows, therefore, that the assumption of 
average stress in the tube wall is invalid (at least for the tem- 
peratures studied), and that the stress causing failure in the thick- 
wall tubes must be appreciably greater than the average value. 

Better agreement between the tubular and bar data is obtained 
when the maximum hoop stress in the tubular specimens is cal- 
culated using the Lamé formula 


+ D2 
D2 — D2 


S 


m = 
where 


S,, = maximum hoop stress, psi 
P = internal pressure, psig 

D, = outside diameter, inches 
D; = inside diameter, inches 


This comparison is also shown in Fig. 7 for the thick-wall speci- 
mens. The success of this formula in providing better correlation 
between tubular and bar stress-rupture data is thought to be 
purely empirical, since it was derived by applying the elastic 
theory, and stress-rupture is essentially a plastic phenomenon. 
Furthermore, the Lamé formula predicts the maximum stress at 
the inner surface, while an inspection of the specimens after 
failure indicates that the failure, in nearly every case, originated 
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Fig. 8 Rupture data for Type 321 tubing; thick-wall tube versus tension bar 


on the outside surface. Therefore, the Lamé formula is not pre- 
sented necessarily as a recommended method of calculating 
stresses in tubes at high temperatures, but simply to illustrate 
the desirability of using a method of calculating stress which in- 
cludes the effect of a stress gradient across the tube wall and pro- 
duces a higher than average stress value. However, it will be 
shown that regardless of any theoretical reasoning, the Lamé 
formula will give a reasonable correlation between tubular data 
and bar data over a wide range of materials, temperatures, and 
wall thicknesses. 

The third set of data were obtained at 1200 F from two-inch 
tubes with 0.512-in. minimum wall thickness of Type 321 stain- 
less steel (18-Cr 12-Ni Ti). As shown in Fig. 8, the agreement 
between tubular and bar data is again poor when the stresses in 
the tubular specimens are computed by the thin-tube formula. 
The data, although somewhat on the high side, are in better 
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agreement when the tube stresses are calculated according to the 
Lamé formula. 

These results pointed out the need for more carefully con- 
trolled tests that would cover a greater range of diameter-to- 
thickness ratios. The specimens for these tests had acccurately 
machined inside and outside surfaces. They were made of two- 
inch diameter tubes of SA-210 steel, and the wall thicknesses were 
varied from 0.16 to 0.48 in. Test temperatures of 850 and 950 F 
were selected. The test pressures were chosen so that the 
stresses in all specimens were the same, as calculated by the thin- 
tube formula. Therefore, if this method of calculating stress were 
valid, all specimens would rupture in approximately the same 
length of time. As shown in Fig. 9, the specimens did not fail in 
the same time period, the rupture times varying by more than 
200 to one. At each temperature, the specimen with the thickest 
wall failed in the shortest time, and the rupture time increased 
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Fig. 9 Effect of wall thickness on rupture life of SA-210 tubing 
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as the wall thickness decreased. When the stresses in the tubular 
specimens are calculated using the Lamé formula, the data fall 
on a straight line on log-log paper with the slight negative slope 
characteristic of stress-rupture data. These results agree closely 
with those of tension-bar specimens of the same material. 

The results of two series of similar tests on Type 321 stainless 
steel material at 1200 F are shown in Fig. 10. The first set of 
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Fig. 10 Effect of wall thickness on rupture life of Type 321 alloy tubing 


specimens, which were stressed at 15,000 psi by the thin-tube 
formula, all failed short of the bar data with the thickest wall 
specimen rupturing in the shortest time and the thinnest in the 
longest time. It should be noted that the total spread of the 
data is much less than for the steel tubing and that the thinner 
tubes are bunched close together. However, the same general 
tendencies, as mentioned previously, are present but are not so 
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Fig. 11 Correlation of tubular data by dimensionless parameters 


well defined. The second set of these specimens also fell short of 
the bar data and in a random order that is difficult to explain. 
A metailurgical investigation of the tubing from which these 
specimens were made revealed a marked difference in appearance 
and nonhomogeneity of grain size across the wall thickness. The 
machining of various amounts of metal from the inside of the 
tubes to provide the various wall thicknesses could have resulted 
in specimens with somewhat different properties. This may ac- 
count for some of the data scatter experienced. These tests will 
be repeated on Type 304 tubing which has a more uniform grain 
structure and will provide more uniform specimens. 

Although a very large number of both theoretical and empirical 
formulas have been proposed for calculating the behavior of 
tubes under high temperature and pressure, none seems to ac- 
count completely for results which have been obtained experi- 
mentally. Furthermore, since stress-rupture properties of ma- 
terials can be obtained with less expense from bar specimens than 
from tubular specimens, and since most of the existing data were 
obtained with bar specimens, it is desirable to establish a rela- 
tionship between these data and tube performance so that the 
data can be better utilized for design purposes. In an attempt to 
establish such a relationship without resorting to the use of 
formulas, the correlation shown in Fig. 11 was devised. This 
correlation uses two dimensionless parameters, D,/t and S,/P rz, 
to relate tubular and tension bar data. The parameter D,/t, 
which is the ratio of the outside diameter of the tube to wall 
thickness, describes the geometry of the tube. The parameter 
S2,/Pr is, in effect, a stress concentration or loading factor. The 
term Sz, is the stress required to produce failure of a tension-bar 
specimen in a given time. P, is the pressure necessary to cause 
failure of a tubular specimen of the same material and at the 
same temperature in the same given time. It is assumed that the 
stress-rupture properties of a material are the same in both tubu- 
lar and tension-bar specimens and that the stress in the tube 
which causes failure is directly proportional to internal pressure 
and a function of the geometry parameter D,/t. 

The degree of correlation of the data in Fig. 11 indicates that 
these basic assumptions are correct, since the data for all ma- 
terials and all test temperatures fall along a single line. However, 
a considerable amount of scatter is noted in some cases. Some of 
this scatter may be attributed to test inaccuracies and material 
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variations which would normally be expected. It appears, how- 
ever, that some of this apparent scatter may be due to an effect 
of time. For example, the data from the thick-wall tubes of 
Type 321 alloy, all of which have approximately the same D,/t 
ratio, show that the S,/P, ratio values tend to increase as the 
time to rupture increases as indicated: 


Ss/Pr Time to rupture, hr 
1.37 527 
1.33 1393 
1.39 2279 
1.38 ? 3199 
1.48 3683 
1.43 5389 
1.47 6675 
1.50 8972 


The result of this tendency may be that a precise comparison of 
data from the two types of specimens is not possible. Further 
investigation of this question is needed. 

It is well to realize that the test condition imposed on these 
tubes at constant temperature and pressure does not represent all 
possible conditions of service. The axial stress was caused by 
internal pressure alone. In many designs tubes act not only as 
pressure containers but also as structural parts, and the axial 
stress may approach or even exceed the hoop stress. The effect 
of larger axial loads on the stress-rupture properties of tubes is 
another area that requires further investigation. 

Still another such area is the effect of heat transfer on stress- 
rupture life of tubes. Most tubes are used as heat transfer sur- 
faces and, therefore, have a temperature differential between in- 
side and outside surfaces. As a result, thermal stresses must be 
added to the stresses caused by pressure, and the properties of 
the material will vary through the tube wall because of the tem- 
perature variation. Whether or not these phenomena will have 
any significant effect on tube life can only be determined ex- 
perimentally, and we hope to investigate these questions in the 
near future. 


Summary 


The foregoing stress-rupture tests, although not including the 
effects of heat transfer and structural loading, permit the follow- 
ing observations to be made: 


1 The variation in rupture time for tubes of the same material 
over a range of wall thicknesses indicates that the assumption of 
average stress across the tube wall is invalid for thick-wall tubes 
at least for the temperature tested. 

2 The average stress assumption is satisfactory for rela- 
tively thin-wall tubes where the effect of stress gradient across the 
tube wall is negligible. 

3 The Lamé formula for maximum stress at the inside surface 
seems to correlate the various materials over a range of wall 
thickness and temperature. This formula should be considered 
empirical as the assumptions under which it was derived are no 
longer valid. 

4 The dimensionless parameter S,/P7 and D,/t correlate all 
the results to a linear relationship with some evidence that the 
scatter shown is time dependent. 
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DISCUSSION 
W. L. Harding? 


The authors are to be commended for this presentation of a 
considerable amount of stress-rupture data on tubular specimens. 
It is particularly valuable to have the detailed test measurements 
reported in such a form that a variety of possible correlations 
can be investigated. 

The writer’s company agrees with the authors’ belief in the 
value of testing specimens of tubular form, and has such a pro- 
gram in progress. Tubular stress-rupture equipment is in many 
respects similar to that described in this paper, but differs in 
that inert gas rather than steam is used as the pressurizing me- 
dium. Results from this testing are not yet complete enough to 
be included in this discussion. What data are available show a 
trend in general agreement with that of the authors, and indicate 
that presently used correlations of tubular rupture with uniaxial 
rupture tests may become less satisfactory as the tube thickness- 
to-diameter ratio increases. The authors have done valuable 
service in calling attention to the uncertainties in this area. The 
situation emphasizes the need for much more test data on thick- 
walled tubes. In view of the scatter of the present data it is 
felt that effort to develop a revised formula or correlation would 
be premature. When more data are available it is suggested 
that the various analyses described by Burrows, Michel, and 
Rankin* may be a useful starting point for consideration of a 
modified formula, if the additional data show that a change is 
needed. 

The authors’ Fig. 11 plots the ratios (S/P) versus (D,/t), 
and the straight line they draw from these data is given approxi- 
mately by the expression 


(S/P) = 0.545 (D,/t) — 0.73 
A trial of the widely used expression for pipe wall thickness, 
S/P = 0.5 (D,/t) — 0.4 


gives a line which appears to fit the plotted points about as well. 
The disturbing aspect of Fig. 11 from the designer’s standpoint 
is the wide spread in the (S/P) ratio for the thick-walled tubes. 
For the relatively thin-walled tubes, of D,/t of 9 or 10, the range 
from highest. to lowest (S/P) is only about 20 per cent, which is a 
tolerable scatter. For thick-walled tubes of D,/t of 4 to 4.5, 
however, the highest (S/P) is about twice the lowest. 

Cause of this amount of scatter needs to be resolved by further 
testing before any simple relation between (S/P) and (D,/t) 
can be considered to be proved adequate for thick-walled tubes. 


J. F. Traexler* and P. C. Warner‘ 


The authors’ idea of correlating creep-rupture data obtained 
from tubular specimens with data obtained from uniaxial creep- 
rupture tests is of great practical interest since it indicates one 
way of reducing creep-rupture testing expense and complexity 
while at the same time insuring adequate tube or cylinder design. 
A successful correlation and understanding would, to a large 
degree, eliminate the need for the more complicated tubular 
testing in so far as tube or cylinder design is concerned 

Since there is as yet no generally accepted fracture theory, 
the question as to what stress, or combination of stresses, should 
be considered when correlating fracture data is an important one. 


? Research and Product Development, Combustion Engineering, 
Inc., New York, N. Y. Mem. ASME. 

* W. R. Burrows, R. Michel, and A. W. Rankin, ‘‘A Wall-Thickness 
Formula for High-Pressure High-Temperature Piping,” TRans. 
ASME, vol. 76, 1954, pp. 427-444. 

Development Engineering, Steam Division, Westinghouse Elec- 
tric Corporation, Lester, Pa. Assoc. Mem. ASME. 
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It is somewhat unfortunate that the authors have chosen to 
consider only the maximum principal stress to be of importance, 
and have thereby eliminated the important triaxiality effect. 
This might have been included, for example, by considering 
both the maximum principal stress and the spherical or hydro- 
static component of the stress tensor. The general trend in 
such a case would be as shown in Fig. 12. 


1, ARE PRINCIPAL STRESSES 


RUPTURE 
Fig. 12 


The authors’ tests may be grouped roughly into two classes; 
one in which the geometry remained fixed and the stress level 
varied, and the second in which the geometry was varied and the 
membrane stress level remained fixed. In both types of tests, 
the authors’ claim that, if the maximum principal stress in the 
tubular test is calculated from the known internal pressure and 
geometry using an elastic analysis, better correlation is obtained 
with uniaxial data than if a membrane analysis is made. It 
is again unfortunate that the authors have limited themselves 
to either a membrane or an elastic analysis and make no mention 
of a plastic analysis, which is at least theoretically correct for 
the loadings and temperatures involved. 

The writers have made such an analysis for the authors’ 
tests on type 321 tubing assuming a nonwork-hardening material. 
Fortunately, the actual stress distribution is not too sensitive to 
this assumption and small error is introduced. A further 
argument for the plastic analysis is that it not only apparently 
predicts the correct magnitude of the peak principal stress but 
also predicts the proper type of distribution. Whereas the peak 
elastic tangential stress occurs at the bore, the peak plastic 
tangential stress occurs at the outside, and the authors have 
pointed out that all failures originated at the outside. The 
results are shown in Figs. 13 and 14 which correspond respec- 
tively to Figs. 8 and 10 in the paper. Fig. 13 shows the results of 
tests on tubes of the same geometry with various internal pres- 
sures. In all cases the curves signify calculated stresses. The 
uniaxial data are shown for purposes of comparison. 

In assessing Fig. 13, it is obvious that the plastic analysis 
gives much better agreement with the uniaxial data than either 
the elastic or the membrane analysis. The question as to whether 
the plastic analysis gives a perfect fit within the limits of experi- 
mental error cannot be answered however, since in the present 
case there is no objective way to assess the experimental error. 
This suggests that taking cognizance of some of the more ele- 
mentary aspects of experimental design from a statistical stand- 
point would greatly enhance the utility of the otherwise very 
careful experimental work done by the authors. For example, 
if, instead of testing at eight pressures, the authors had tested 
two specimens at each of four pressures, independent estimates 
of the experimental variability and lack of fit of the data to any 
model could be made. By comparing one variability with the 
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other, it would be possible to conclude whether or not any particu- 
lar model was consistent with the data. Fig. 14 shows the 
results of tests on tubes of various geometries but with the same 
membrane stress level. Here, the correlation is decidedly poorer 
than for the previous tests, but again the plastic analysis agrees 
more closely with the uniaxial data than does the elastic analysis. 
The authors have pointed out that machining may have caused 
changes in material properties and that specimens used in these 
latter tests showed marked nonhomogeneity in so far as grain 
size was concerned. This may be a part of the reason for the 
poor correlation in these cases. 

The writers have also studied the authors’ Fig. 11 with some 
interest and have plotted the theoretical curves obtained from 
membrane, elastic, and plastic analyses in Fig. 15. Assuming, 
as the authors did, that perfect correlation exists (i.e., 77 = Sg) 
the plastic analysis gives the following expression: 

i 


D,/t — 2 
It can be seen that this curve gives a good fit to the data and at 

the same time has theoretical significance. 
A perusal of the curves in Fig. 15 indicates that, if part of the 
reason for the tests is to choose among elastic, membrane, and 
plastic-stress analyses, the testing should be concentrated toward 


the thick-walled tube end of the diagram in order that the theories 
give results sufficiently different to enable a choice to be made 
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experimentally. 
reasonable extrapolation of results or the answering of some other 
like question, experimental-design techniques would permit an 
optimum design of the experiments for the chosen purposes and 
the economics of the case. No less important, they permit an 
objective assessment of the chance of drawing erroneous con- 
clusions from the data. 


Further, if the purpose of testing included a 


‘A. B. Wilder’ 


The authors have presented important high-temperature data 
with specific reference to the behavior of small diameter heavy 
wall tubing. Due to higher steam temperatures and pressures 
there has been an increased use of heavy wall tubing in boilers. 
It is our understanding that most of this heavy wall tubing 
is used under Section I of the ASME Boiler and Pressure Vessel 
Code. The thin-tube formula used by the authors is similar to 
the Boiler Code formula. The Lamé formula provides for 
heavier wall material and the results of the authors indicate this 
formula should be used. Data of the type presented by the 
authors and additional information concerning various heats of 
the same grade and different grades of steel should be available 
from various laboratories before definite conclusions are formed. 
We would appreciate comments of the authors regarding the 
use of heavier wall tubing in the ASME Boiler Code. 

Has consideration been given to determination of the creep 
rate under conditions of the tubular creep-rupture test? Al- 
though tubes fail by rupture, their behavior before rupture in 
comparison with bar tests would be of interest. 

The authors have indicated that in the creep-rupture tests of 
tubing the inside diameter increases and the wall thickness 
decreases as the tube expands during testing. In burst testing 
large diameter pipe 24 to 36 in. OD and 40 ft long at ambient 
temperatures, it has been observed that for 75,000-psi ultimate 
strength material the bursting strength as calculated by Barlow’s 
formula is approximately 5,000 psi less than the transverse 
tensile strength. This has been associated in part with the in- 
crease in diameter of the pipe as we are in the plastic rather than 
elastic range during the latter stages of burst testing. The wall 
thickness decreases during the test and this results in work- 
hardening which would be in the direction of increasing the 
strength properties of the material. Comments of the authors 
regarding the possibilities of work-hardening or change in the 
microstructure of the material during the reduction of wall 
thickness at elevated temperatures would be of interest. 


5 Chief Metallurgist, National Tube Division, United States Steel 
Corporation, Pittsburgh, Pa. 
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Authors’ Closure 


The authors wish to thank the discussers for their thoughtful 
comments and their valuable suggestions. The questions 
raised will be discussed in the order presented. 

The participation of Mr. W. L. Harding’s company in testing 
tubular specimens is very welcome indeed, because the additional 
data will help to definitely establish trends and reduce uncer- 
tainties due to scatter inherent in testing materials properties. 
Their results, though incomplete, support the authors’ position 
that the presently used correlation between bar data and tubular 
data may not be satisfactory for thick-wall tubes. This problem 
is now being reviewed by an ASME code committee. 

The authors agree with Mr. Harding’s comment that data 
scatter in the plot of ratios of S,/P, versus D,/t, particularly in 
the thick wall range, leaves much to be desired. However, the 
curve was drawn principally through the crossed points for the 
SA-210 specimens whose dimensions were accurately machined 
inside and out. A recent set of data, obtained on accurately 
machined specimens of Type 304 alloy, also fits the plotted curve 
closely. These confirming results point to the possibility that 
some of the scatter shown may be due to the use of unmachined 
specimens. Nevertheless, the authors agree that this scatter 
needs to be reduced or explained before this simple relationship 
can be used for thick-wall tube design. 

The authors wish to express their appreciation to Messrs. 
Traexler and Warner for their discussion of the plastic analysis 
we referred to in our paper. They have stated the problem ex- 
actly when they question what stress or combination of stresses 
should be used when correlating fracture data, especially since 
there is no generally accepted fracture theory. Since this state- 
ment describes the present state of the art, any procedure which 
correlates the test data without taking into account strain hard- 
ening, creep, relaxation, wall thinning, differential diametral 
growth, increased pressure area, combined stress conditions 
that vary across the tube wall, varying creep rate with varying 
combined stress conditions, initial elastic and plastic deformation, 
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oxidation, metallurgical changes, etc., must be considered empiri- 
cal. A satisfactory theoretical analysis of the many variables 
affecting stress-rupture life in a tube may never be attained. 
However, a number of empirical relationships may be set up 
which will be adequate for tube design. The good agreement 
shown by the discussers in their plastic analysis of the Type 321 
tubing must be considered an empirical relationship because of 
the reasons mentioned in the foregoing. This same analysis, 
when applied to the thick-wall 2'/,-Cr 1-Mo specimens gives a 
poorer correlation. These qualifications also apply to the elastic 
analysis presented by the authors and it remains for future cor- 
relations of test data to determine whether either of the proce- 
dures mentioned, or perhaps some other, will be best suited for 
tube design. 

The suggested use of a statistical approach for setting up the 
test programs is a useful one as it permits an optimum design 
of the experiment. However, when only eight specimens of a 
given material are to be tested, the advantage of determining a 
probable experimental error by testing two specimens each at 
four test conditions may be overshadowed by testing each speci- 
men at a different stress level and drawing a least squares curve 
through the eight points. The slope of a stress-rupture curve 
is better defined by eight points than by four. If a larger number 
of specimens are involved, the statistical procedure suggested 
‘an be used to an advantage. 

With respect to Mr. Wilder’s question regarding the use of 
heavier wall tubing in the ASME Boiler Code, our results indi- 
cate a smaller factor of safety for thick-wall tubes than for thin 
wall tubes when using the code formula. A code committee 
has been set up to review the data pertaining to this problem. 

Creep rates have not been determined under the conditions of 
the tubular stress-rupture tests. However, two tests were made 
by periodically measuring at room temperature the diametral 
growth occurring in a specimen during a cyclic stress-rupture test. 

The authors feel that work-hardening and changes in micro- 
structure occur during stress-rupture testing with a resulting 
effect on rupture life. 
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Design of Pressurized Cylinders for 


J. F. TRAEXLER 


Development Engineering, 
Westinghouse Electric Corporation, 
Lester, Pa. Assoc. Mem. ASME 


High-Temperature Applications 


General equations for the stresses in a thick-walled cylinder in a state of plane strain 
are derived considering ‘‘steady-state’’ creep. 


A specific form of the creep-rate function 


is assumed and numerical examples are included to show the effect of geometry and 


material properties. 


Introduction 


a EVER-INCREASING USE of higher pressures and 
temperatures in the power-generation field has led to a renewal 
of interest in the mechanical design of components subjected to 
creep. One commonly occurring problem concerns the design of 
thick-walled pressurized cylinders carrying radial temperature 
gradients where the stresses and temperatures are such as to cause 
creep. Clearly an elastic analysis is meaningless in such cases. 
This paper will present a stress analysis for such a case assuming 
the cylinder to be fully plastic and the pressure and thermal load- 
ing to be independent of time. 

The first problem confronting the design engineer deals with 
the choice of a suitable failure criterion. In some cases, compo- 
nents fail because of excessive distortion, whereas for other appli- 
cations fracture itself constitutes failure. The choice of a failure 
criterion obviously depends on the application of the component 
and the consequences of failure. There is however one feature 
common to both of these failure criteria. Each requires a de- 
tailed knowledge of the stresses before any conclusions as to the 
adequacy of a design can be reached. Consequently, this paper 
will deal in the main with the problem of determining stresses 
as functions of the space co-ordinates, with only a brief discussion 
of the procedure for applying either of the failure criteria. 

Simply stated, the method will consist of extending the funda- 
mental equations of plasticity to cover the creep problem, recog- 
nizing that creep is both temperature and stress dependent.! 
The derivations will be general at first and specific engineering 
formulas will be obtained from the more general solutions. 
Numerical examples will be included to show the effect of ma- 
terial properties, temperature gradients, and geometry on the 
stress distribution. 


'C. D. Weir, “The Creep of Thick Tubes Under Internal Pres- 
sure,”’ Journal of Applied Mechanics, vol. 24, Trans. ASME, vol. 
79, 1957, pp. 464-466. 

Contributed by the Metals Engineering Division and presented 
at the Metals Engineering Conference, Albany, N. Y , April 20-May 
3, 1959, of Toe American Society oF MECHANICAL ENGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those 
of the Society. Manuscript received at ASME Headquarters, Febru- 
ary 24, 1959. Paper No. 59—Met-8. 


Nomenclature 


Discussion 


The problem to be considered deals with a heavy-walled cylin- 
der in a state of generalized plane strain (€, = const). It will be 
assumed that elastic strains are negligible in comparison to plastic 
strains, thus allowing the use of the Saint Venant equations of 
plastic flow in place of the more rigorously correct equations 
of Reuss. Furthermore, it will be assumed that only long-time 
problems are of interest, thus allowing the use of a constant creep 
rate. In other words, second-stage creep will be assumed to pre- 
dominate. Triaxia] stress states will be reduced to equivalent 
uniaxial states by means of the distortion strain energy theory of 
von Mises-Hencky. 

A brief outline of the method of solution follows: For greater 
detail the reader is referred to the Appendix. 

The Saint Venant laws are: 


= Vo,,' 


= 90 


WV = Vic., T) 
and in accordance with the distortion strain energy theory 
20 = — a,)* + — o,)* + (o, — (4) 


Equation (1) states the existence of a functional relationship 
between the strain-rate deviator and the stress deviator while (2) 
is the incompressibility relation. Equation (3) specifies that the 
function relating strain rate and stress is dependent on both the 
state of stress (¢.) and the state of temperature (7). 

Making use of these equations and the previously stated as- 
sumptions, we arrive at the following general expressions for the 
radial and tangential stresses 


( 


o, oW dT 
oT dr ) (5) 


strain 

strain rate, 
stress, psi T 
temperature, deg F 
radius, in. a 
inside radius, in. 
outside radius, in. 
internal pressure, psig 


radial displacement, in radial 
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creep-rate function 
creep-rate function at constant 


creep-rate function at constant 


tangential 

axial 

combined or equivalent 
at bore 

at outside 


= material constants 


( )’ = deviator part of tensor 
( )” = spheric: part of tensor 
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d 
a) 
(2) 
where 
+ ( | 
: 
€ = ( = 
«= (),= 
(). = 
r= 
a= 
6 = Superscripts 
P= 
| 


de, WV oT dr 


ov dT 


If one now assumes 
V(o,, T) = (7) 


there is considerable reduction and (5) and (6) become 
om l 
+1 
V3 
— f IndXdo, f In Odo} +C (8) 


o,= V3 {a + In + InA)o, 


deg +0 (9) 


Finally, if one assigns specific forms to @(¢-) and X(7'), as for 
example 


= Ao.” (10) 


X(T) = (11) 


and recalls that for a cylinder with a steady-state radial tempera- 
ture gradient 


T=minr+k (12) 


where 


T, — T,lnb 


k 


a 


the resulting equations are 


i-2+am 
1 
n 


Note that for the isothermal case m = 0 and if, in addition, 
n = 0, we arrive at the isothermal elastic case. 

The form of @(¢,) used in Equation (10) is but one of several 
common in engineering usage. The form of \(7’) chosen in (11) 
is not quite so common. Originally introduced by Bailey? it 
has lost favor in recent years due to the application of chemical 


* KR. W. Bailey, “The Utilization of Creep Test Data in Engineering 
Design,"’ The Institution of Mechanical Engineers, London, England, 
1935, p. 246. 
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rate theory to creep processes. The form of (11) consistent with 
chemical rate theory would be 


MT) = De®? (17) 

Either this form or that in (11) can be made to fit test data 
reasonably well provided that the temperature range is not too 
large. The greatest advantage of the form used in (11) is that 
it provides considerable mathematical simplification and leads 
to final closed form equations. For the range of temperature of 
practical interest, little error is introduced by using it in place of 
(17), and there is a great saving of computational time and 
effort. 

Numerical examples based on Equations (15) and (16) are 
shown in Figs. 1 to 4. In each case the thin cylinder or mem- 
brane approximation is included for purposes of comparison. The 
values of n and @ used are not for any specific material but are 
typical values that might be expected in practice. 

Fig. 1 shows the effect of temperature gradient on the stress 
distribution for a cylinder with fixed geometry and material. 
In all cases the bore temperature is held constant. Note that 
there is a shift of the stress field with the lower temperature and, 
hence, stronger elements at the outside picking up load at the 
expense of the inner fibers. It is this feature that makes double 
wall construction desirable for high-stress, high-temperature ap- 
plications. Note also the closeness 6f the membrane theory, 
especially in the absence of a radial thermal gradient. No ap- 
preciable difference in the radial stress distribution occurs for 
the gradients chosen. 

Fig. 2 shows the effect of n on the stress distribution in a given 
cylinder with a fixed gradient. Higher values of n designate 
lower strain-hardening ability and this results in flatter tangential 
stress distributions. This is to be expected since an n of zero 
corresponds to the elastic case. For high n values, the membrane 
theory improves asan approximation. Higher n values, however, 
mean that there will be less of a shift toward the stronger, cooler 
elements and for an optimum design a balance must be worked 
out between loading, geometry, and material. Again, no ap- 
preciable difference exists in the radial stress distribution for the 
range of n values chosen. 

Fig. 3 shows the effect of a on the stress distribution. Equa- 
tions (15) and (16) show that an increase in a has the same effect 
on the stress distribution as a decrease in n and vice versa. This 
trend is borne out by the curves. 

Fig. 4 shows the effect of a geometry change for a given ma- 
terial and temperature gradient. As may be expected from 
membrane theory considerations, thinner cylinders show higher 
over-all stress levels. 


Conclusions 


General equations for the radial and tangential stresses in a 
thick-walled pressurized cylinder in a state of generalized plane 
strain have been derived. Engineering formulas have been ar- 
rived at by specifying a specific form of the creep-rate function. 
The effects of material properties, geometry, and thermal load- 
ing have been brought out through the use of numerical ex- 
amples. 

Having the complete stress distribution, it is fairly obvious as 
to the procedure to be followed for designs using rupture as a 
failure criterion. Experimentally determined creep to rupture 
data are required for the stress and temperature levels involved. 
From this, an estimate of the service life may be made. 

For designs having specified limits of distortion, an additional 
integration of Saint Venant’s strain-rate equations with respect to 
time is necessary. The geometric definition of strain is then ap- 
plied to compute the actual displacements. 
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Fig. 1 Effect of thermal gradient on stress distribution 
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Fig. 2 Effect of material constant (nm) on stress distribution 
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Fig. 4 Effect of geometry on stress distribution 
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It should be noted that the solution methods of this paper 
depend on an assumed creep-rate function. Additional work is 
required to establish the validity of this assumption especially for 
the general triaxial state of stress. The solution of the present 
paper can be extended to cover transient cases as well as steady 
state but, in general, closed-form solutions will not be possible. 
It is also possible to consider problems with variable creep rates 
but again closed-form solutions will not result. 


APPENDIX 
Derivation of Stress Equations for Steady-State Creep 
For incompressibility 


=0 (18) 


dt 


(19) 


for generalized plane strain 
é,=0 
then 
é = —€, 


the equilibrium equation in polar co-ordinates is 


d 
(ré,) = —€, 


c 
é= 
from the distortion strain-energy theory 
20,2 = (o, — o,)? + (o, — o,)? + (o, — o,)? 
the Saint Venant equations are: 
€é,,'= 
=0 
using this with €, = 0 


+ 


then 


Applying Saint Venant’s law to the tangential directions 


1 


then 


_ 


ry 
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differentiating 


dr ry r? LW? 


3 o, oW dT) 


oT ur 


Fe ov 
oT 


and from the equilibrium equation 


do, 
Oo,=r—+da, 
dr 


= Oo) NT) 


{1 — Indo, 


— \n6do,} +C (Al) 


= Ao? 


MT) = Ber? 


+n)o,+ f acdT} +C 


for steady state 


T=minr+k 


T, — T, Inb 


In 
b 
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do, dr oT dr 
(34) 
finally 
do, 1+ a, ov r OW dT (35 
using the equilibrium equation 
dr 
da 1 o, ld 
or — = — — |= 37 
or 
1 
| as 
—! 4+ = 22 
3 « the geometric definition of strain gives ot 
du du 
r r 
(26) 
(27) vs 
(28) (44) 
29 
b 
(30) 
V3 (46) 
but 
3c 
| 
r= <>) (48) 
33 


2+am 


2k 2+am (49) i+n 2 
AB b L 
Applying L’ Hospitals’ rule 
or 

In b/ 
a. = (50) lim o, = (57) . 
In b/a 

then 2+am 

( I+n )-0 

= — + + f aDe-*™dT} + C (51) Thus showing that the stresses always remain finite. 


DISCUSSION 
a 


v3 
An exponential dependence upon flow criteria may also be 
but D and C are determined from handled in the manner suggested by the author. Assuming 


@r=a (53) = A exp + 5; 
o,=0Gr=b6 

with the temperature distribution given by Equation (12), the 
resulting equation for the equivalent stress is 
2+am 2+am ‘ 
3 P 2+ am 


r b 
(54) 


2+am 


finally 


In this model, very low strain rates are represented by negative 
ea ( a ) = exponentials, which is not physically realistic (the use of a nega- 
b tive exponential is equivalent to having tensile strains being 
caused by compressive stresses). Thus this material model 
and from the equilibrium equation cannot be used unless 


24 am (2 + am) 2 
™+(¢) /G > V3 n ak 
l+nhn r “ 
o,=P (55) 
a\ Author's Closure 
( ) The author wishes to express his appreciation to Dr. Merckx 
for his interest in the paper and his comments on an alternate 
ms form of @(¢,). In view of the form of (7) used in the paper, 
2 + am (“ ) — the form of 6(¢,) used by Dr. Merckx is particularly convenient. 
It is also very gratifying to see some attention paid to the often 
2 2+am (56) neglected question of physical significance of the assumed form 
a\ 't* of flow criteria as distinguished from mathematical convenience. 
= ) This is a point that cannot be emphasized too strongly and should 
always be investigated before applying the results of any analysis. 


note also that since 0 < <i *Fuel Element Design Operation, Hanford Laboratories Opera- 


) tion, General Electric Company, Richland, Wash. Mem. ASME. 
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The Response of a Pressure-Sensing System 


A first-order, first-degree, linear expression is developed to describe the transient be- 
havior of a practical pressure-sensing system. The development is in terms of usual 
fluid-flow parameters. 


An experiment is described whereby the transient behavior is determined for a variety 
of pressure-sensing systems. Experimental uncertainties are determined. 

Comparisons are drawn between several published expressions, the proposed linear 
expression, and the experimental data reported herein, to indicate the validity of the 


proposed expression. 
Selected references to related work are given. 


—— are usually sensed by means of probes 
which consist of surface orifices opening either to small-diameter 
tubing for pneumatic signal transmission, or to diaphragm trans- 
ducers for electrical signal transmission. Only pneumatic 
pressure probes which are connected in turn by other tubing 
(usually flexible, and of a larger diameter) to transducers will be 
considered here. The transducer serves to convert pressure effects 
to some easily measurable signa] such as a mechanical motion or 
an electrical output. Such transducer signals are then interpreted 
back in terms of pressure via fundamental relations (as Ap = 
wh for manometer transducers) or by experimental calibrations 
(as Ap = KAE for diaphragm transducers). 

Often, the orifices and associated tubing are subjected to time- 
varying pressures (e.g., there may be a nonsteady flow, or the 
probe may be moving, perhaps rotated, through a pressure- 
gradient field). Thus, while in the steady state, pressure-sensing 
systems are usually of the nonflow variety, yet in transient, there 
will usually be a flow in the system accompanied, at the least, by 
a flow resistance in the orifices and tubings and, in general, by 
compressible effects within the transducer. As a result, the pres- 
sure signal at the transducer lags the impressed pressure at the 
orifice. 

It is the purpose of this paper to indicate a simple expression 
which may serve to predict the response behavior of a pressure- 


Contributed by the Instruments and Regulators Division and pre- 
sented at the Annual Meeting, Atlantic City, N. J., November 29- 
December 4, 1959, of THe American Society oF MECHANICAL 
I.NGINEERS. 

Nore: Statements and opinions advanced in papers are to be 
understood as individual expressions of their authors and not those of 
the Society. Manuscript received at ASME Headquarters, October 
19, 1959. Paper No. 59—A-289. 


Nomenclature 


sensing system when it is subjected to a variety of time-varying 
pressure signals. The problem will be approached from the fluid- 
mechanics point of view, rather than from the control viewpoint 
(via the Laplace-transform natural-frequency damping-factor 
approach) [1, 2],! and rather than from the viewpoint of wave- 


motion propagation (via acoustical theory) [3, 41. 


Elementary Theory 


A single time constant can describe the response behavior of 
only the simplest of systems. Fortunately, however, it is often 
adequate to distinguish between but two regions in a pressure- 
sensing system. 

A In one of these regions, call it the capillary portion of the 
system, all the fluid resistance is “lumped.’’ (Norte: If the system 
has more than one capillary, an equivalent length may be defined, 
Ref. [5], page 7, as 


d, \* i, 
L, + +...+4,(4) (1) 


where d; is arbitrarily taken as the reference diameter. ) 

B_ In the second region, which will be called the transducer 
portion of the system, all the fluid mass is ‘‘stored.’’ (Note: The 
volume of this transducer portion should include the volume of the 
equivalent capillary.) 

The system, for which a linear expression describing transient 
behavior is to be developed, is illustrated in Fig. 1. 

In the smooth-walled, circular capillary, an axisymmetric, 
laminar flow of a compressible, viscous fluid takes place in tran- 
sient. Discussion will be limited to the manometric type of 
transducer since it represents the more general case (system vol- 


1 Numbers in brackets designate References at end of paper. 


flow area of capillary, power of 


capillary pressure 


axial component of fluid velocity 


variable Pp, = environment pressure at any time in capillary 
A, = equivalent area p; = final environment pressure w = specific weight of manometer 
a, = e7tK p; = initial environment pressure fluid 
D = ID of manometer tube P, = reference pressure : SS 
d, = ID of any capillary p, = transducer pressure at any time 3 finite we — 
E f 5 R = gas constant, result € = uncertainty 
base of natural logarithm any capillary = dynamic viscosity coefficient 
= capillary radius from center to 
h = height of manometer fluid wall Pe fluid density at capillary-trans- 
ducer interface 
K = lumped constants of system, time T = absolute temperature 
constant = time 7 = shear stress 
L, = length of equivalent capillary V = system volume at any time, varia- aa On Ov: Ons 
= length of any capillary ble ozs 
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Fig. 1 Pressure-sensing system te be analyzed 


ume being a function of time). To further preserve generality, 

a distinction is made between environment and reference pres- 

sures although, experimentally, initial environment pressure 

would be stepped most conveniently to the reference pressure. 
Such a system may be characterized [6] by writing a flow bal- 

ance at the capillary-transducer interface (i.e., at z = L,). 
Mass flow rate into transducer = rate of increase of mass in 

transducer 


re dV dp, 
= V 
Pc dt dt 


The flow balance is localized in this manner to simplify the 
analysis, since at this interface the density term has consistent 
meaning. 

Appropriate expressions will now be developed for the various 
terms in equation (2). 

A Four distinct pressures may be noted in the system. 


1 The environment pressure p, undergoes a step change 
with time. Att = 0-, p, = p,;; att = 0*, p, = py. 

2 The capillary pressure, p = f(z, r,t) may be defined in terms 
of its variation within the capillary by writing instantaneous 
force balances on an annular element in both the axial and radial 
directions. 

Axial Direction, Fig. 2(A) 


Net shear force = — (rd@dxr,,)dr dOdxdr (3) 
or or 


Net pressure force = — — (rd@drp)dx ™ d@drdz rp] (4) 
or or 


Assumption: Second-order terms are negligible. 
Now, if the [F, are set equal to zero 

Assumption: Inertia forces are negligible compared with vis- 
cous forces in the ‘axial direction. 


there results (T,.7) (5) 


or 
Radial Direction, Fig. 2(B) 
Net shear force = —(rd@dr) om dz 


Net pressure force = 


~(rd0dz) 2. ay 
or 


Assumption: Second-order terms are negligible. 
Now, if the [F, are set equal to zero 
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A) AXIAL DIRECTION 


(B) RADIAL DIRECTION 
Fig. 2 Various stresses acting on the fivid element 


Assumption: Inertia forces are negligible compared with viscous 
forces in the radial direction. 


there results = (8) 


Nore: Equations (5) and (8), which indicate axial and radial 
equilibrium in the absence of inertia and body forces, may be 
recognized as forms of the Navier-Stokes Equations in cylindrical 
co-ordinates with axial symmetry assumed, [7] p. 49, [8] p. 37. 
Also, by taking moments about any axis, under equilibrium con- 
ditions, it may be seen, [7] page 41, [8] page 20, that pairs of 
shearing stresses with subscripts which differ only in their 
order are equal, i.e., T,. = Tz, 

3 The transducer pressure, p, = f(t), is the function de- 
sired. Initial and final conditions are as follows: 


att = 0, Pi = Di 


Nore: Transducer dimensions are such that the pressure p, 
may be considered uniform throughout at any time. 

4 The reference pressure p, is a constant and needs no 
further definition. It may be greater than, equal to, or less than 
the transducer pressure. 

B_ The density of the fluid at the capillary-transducer inter- 
face may be defined as 


Ps 
RT f(t) (9) 
Assumption: Relations between thermodynamic quantities 
of fluid are closely approximated by the equation of state of an 


idealized gas. 
The rate of change of density in the transducer is 


(10) 
dt RT dt 
Assumption: Fluid density changes isothermally in transducer. 
C Lumping the volume of the system in the transducer, just 
as resistance of the system was lumped in the capillary, V., is 
assigned to designate the steady-state volume of the entire sys- 
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Fig. 3 Definitions of equivalent area for various manometers 


tem, from orifice to surface of the manometric fluid. The system 
volume at any time is then 


A, 
V= Ve — — (11) 
w 
where A, is an equivalent area dependent on the geometry of the 
manometer used (see Fig. 3). 
The rate of change of volume is 


(12) 


D_ The instantaneous axial component of the velocity in the 
capillary is obtained by separating variables and twice integrating 
equation (5), the expression for the axial pressure gradient in the 
capillary, where, by definition 


Ov 


13 
(13) 


and where, as a direct consequence of equations (8) and (17), 
op/dx # f(r), as indicated in greater detail at a more appropriate 
place in the development. 

There results 


1 
— — Gar t+ 
4u Ox 
Introducing boundary conditions and evaluating 
v = finite atr = 0, 
v = zero atr = 19, 
Assumption: There is no slippage of fluid at the solid bound- 
aries, Ref. [9] p. 168. 
Thus an expression for the instantaneous axial velocity at any 


radius becomes 
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(15) 


Assumption: The time constant of the system is large com- 
pared with the time required for the establishment of a parabolic 
velocity distribution typical of flow in the Hagen-Poiseuille 
regime, Ref. [7] page 66. 

The flow balance (2), may now be written as 


To 

Pe 1 Op 2 2)9 
0? — rd 
ii 


1 A, . A,\ dp, 
RT (>, + (py Py) dt 


At this point in the development it becomes convenient to im- 
pose on the solution the limitation that 


— D)< (17) 


Nore: This limitation does not imply the assumption of an 
incompressible fluid within the system, but only that the fluid 
is essentially “uncompressed’’ within the capillary. Indeed, it 
is just because the fluid within the transducer is compressible 
(i.e., fluid density changes as transducer pressure changes with 
time) that there exists a time delay. Inertia of the manometric 
fluid would likewise contribute to the time lag [10] page 2, but 
all inertia effects have been neglected in this analysis. By limit- 
ing the analysis to those cases where pressure changes are small 
compared with pressure level, nonlinearities,? which must arise 
{11] when density is considered to be a function of pressure, are 
avoided. 

Nonlinearities introduced by end effects (orifices) or flow 
rectification (different flow coefficients in and out of the capillary) 
have also been neglected [12] page 102. 

Assumption: 


(16) 


Nonlinear pressure effects are negligible com- 
pared with the linear pressure drop across the capillary. 
Equation (16) may be reduced to a first-order, first-degree, non- 
linear differential equation, and as such precludes definition of a 
time constant; but a time constant is precisely what is being 
sought. Thus seeking a linearized form of equation (16), is 
just the reason for the introduction of equation (17). For, con- 
cerning pressure level, equation (17) implies that p, which is 
bounded by p, and p,, may be well approximated by its average 


So 


value, i.e., by ; (18) 

But, concerning pressure variations with time, dp,/dt cannot be 
replaced by dp,/dt (which latter of course equals zero), since 
system lag derives from the very fact that p, = f(t). 

Note: Mathematically, equation (17) indicates that (concern- 
ing level) p, may be approximated by either p; or p,; but prac- 
tically a departure will be made from the limitations of equation 
(17) (<is really undefined anyway); and thus pt is a more realis- 
tic approximation to p, (as will be seen in the section on com- 
parisons). 

Making use of the approximation, p, ~ p,, the flow balance as 
given by equation (16) simplifies to 


( 128u ) ar [4 (2: — Py) + dt 

Here it is observed that the imposition of equation (17) on the 


solution is tantamount to staring that Op/dr is a constant with 
respect to z, and that p # f(r). For note that, when the fluid is 


(19) 


2? Nonlinearity here means that effects are not proportional to 
causes. That is, there is no simple straight-line relationship between 
input and output, no matter how the plotting scales are distorted. 
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considered ‘‘uncompressed’’ within the capillary, and when the 
fluid flows laminarly in the axial direction (i.e., v, = vg = 0), 
then continuity (i.e., v;, = 0) insures that ov/dz = 0. Thus, 
v # f(x), and by substituting equation (13) successively in equa- 
tions (5) and (8) then 


and 


Consequently, by separating variables, equation (19) may be 
written as 


dp 
128u 
A d Le 
(2p, — + v.| Pe dx (20) 
w dt 0 


Assumption: Fully developed laminar flow is established be- 
tween points of integration, Ref. [7] page 149. 
Upon integration the first-order, linear expression 


dp, 


1 
(py — Ps) (21) 


is obtained where K lumps the constants of the system, and is de- 
fined mathematically as 


A, 

K = (2p, — py) + Va 22) 
| w Py 

K will now be shown to be the time constant of the pressure- 
sensing system. Equation (21) has the general solution [13] 


Pp, = 4 | (23) 


For the case of a pressure-sensing system subjected to a step- 
change in environment pressure, the initial condition is 


att =0, pp =p, =p; 
and at any time, p, = constant. The general solution reduces to* 
(py — = (py — (24) 


Thus K indicates the time required (after the step change) 
for the environment-transducer pressure difference to be reduced 
to 1/eth of the initial difference. In other words, K is the time 
required for the pressure-sensing system to indicate a pressure 
corresponding to approximately 63.2 per cent of the initial pres- 
sure difference. This is, of course, the meaning reserved for the 
time constant of a first-order linear system so that K equals 
the time constant. Caution: For higher order, variously damped 
systems, the time constant has no such significance, [2], page 8. 
That is, if the damping factor of a second-order system was very 
small, one would expect a decaying oscillatory response, and the 
63 per cent recovery would have little meaning; coriversely, if 
the damping factor was 21, the second-order response would 
appear much the same as a first-order response, and the 63 per 
cent recovery would appear to have significance. For Reynolds 
numbers below 2000, a capillary acts like a highly damped system 
and therefore appears much the same as a first-order, linear 
system. 


* A similar expression is given by H. L. Weissberg in Ref. [14], how- 
ever, his K is defined differently. 


Fig. 4 Several approximations to p; for a step change in environment 
pressure 


The time required to reduce the pressure difference to any ac- 
ceptable value may be predicted from the expression 


Py — Di 
which tabulates as 
a, 05 | 0368 | 02 | 0.1 0.005 0.01 
| o7K | K | 1.6K | 2.3K 3K «4.6K 


It is now possible to evaluate p, and thus to obtain explicit ex- 
pressions for both the time constant (K) and the recovery time 
(t). 

Inserting p, from equation (24) and ¢ from equation (25) into 
equation (18), then 


1 — a, 
= Py — (Ps — Di) (26) 
In 
a, 


Now, since p, asymptotically approaches but can never equal 
p, in any practical time, a cut-off point for a, must arbitrarily be 
decided upon if a realistic p, is to be evaluated (see Fig. 4). For 
simplicity, an a, corresponding to 98 per cent recovery was chosen. 
Under this condition 


(Py — 
= Py — 


where p, represents the average value of p, from time zero to a 
time corresponding to the 98 per cent recovery point, i.e., during 
the greater portion of the transient phenomenon. 

The time constant now becomes 


~ 
md,*p, | w 2 TVe (28) 


and the general expression for recovery time becomes 


128uL, (Py + Ps) + v.| In (2 (29) 


md,*p, w 2 Py — Pe 


Experiment 


An experiment was designed to allow determination of the 
transient behavior for a variety of pressure-sensing systems. The 
over-all test setup is illustrated in Fig. 5. Pressure levels of '/,, 
1/,, 1, and 2 atmospheres were used in this experiment. Cello- 
phane was chosen as the diaphragm material, with the under- 
standing that no appreciable energy was imparted to the fluid by 
the bursting diaphragm, and with the finding that it was econom- 
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ical, safe, and easy to manufacture in the form of gaskets. An 
Exacto knife was electrically actuated by a solenoid to puncture 
the diaphragm. Ref. [15] describes an experiment in which glass 
diaphragms were used. Ref. [16], page 697, describes a “high- 
speed’’ orifice design which was also tested. The micromanome- 
ter was specially designed to allow electrical indication of the 
manometer-fluid surface location. Ref. [17], page 3, mentions use 
of a similar system. The response time was obtained auto 
matically by an electric timer which started counting when the 
knife burst the diaphragm, and stopped when the moving mercury 
column in the micromanometer contacted the micrometer rod 
(preset for the desired per cent recovery). 


Experimental Uncertainty 


The important question as to how the uncertainties of in- 
dividual measurements are propagated into the results will be 
considered now. 

Following Ref. [18], page 3, use will be made of a second-power 
form of the logarithmically differentiated expression relating 
variables and result; i.e., 


R +{ (4 (4: (4, | 


(30) 
where 
€ = uncertainty, or possible value of error 
A, Ar, A, = powers to which respective variables occur in 
expression relating variables and result 
R = result 
V;, V2, V, = variables 


Thus the system will be analyzed using observed uncertainties 
in individual measurements to form conclusions as to the worth 
of experimental answers. Considering equation (29), it is noted 
that, in general, the term formed with V,, far outweighs in im- 
portance the other two terms in the expression. So, in the uncer- 
tainty considerations, this term only will be considered. Further- 
more, the analysis will be limited to the '/s-atmosphere pressure 
level since the greatest experimental uncertainties are encountered 
there. 

All variables are first listed in order of importance, and then 
with the help of equation (30) several uncertainties to be ex- 
pected in the experimental determinations of time are evaluated. 
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Fig. 5 Schematic of experimental set-up 


OME 
FOR PRESSURE LEVEL 


The relative uncertainty in time (€,/t) is found to be approxi- 
mately +6 per cent for any per cent recovery. However, for 
tests on a given geometry, the length, volume, and diameter are 
fixed quantities and the experimental scatter should be much 
less (say +3 per cent) than that predicted by the foregoing 
analysis which concerns uncertainties in measurement level as 
well. 


Comparisons 


Graphical comparisons were made between 1B M-704-computer 
solutions to the following equations. 


= B/p.| Ve + In 
This is the linear equation developed in this paper, and is based on 
~ Dy — (Py — 


A, 
Bir, | Va | In 
w a 


This is a linear equation based on p, ~ py. 


A, 1 
t; = — (2p; — 


w a, 


This is a linear equation based on p, ~ p,. 


1 A 1 
B/p,| Va In — + — (», In — + 3p, 
a,b, w a, 


This is a nonlinear equation based on Equation (3) of [5], page 6. 


1 A, b, 1 
— 4+ — — — 
ab, w ( a, "ab )| 


t 


This is a nonlinear equation based on Equation (8) of [19], page 
14. 


where: 


B = 128uL,/md,*F 
F = 1/((L2/Iy)(di/d:)* + 1) 
Ve = Vit V2+ Vn 
KV, , (K+ 1)V2 
Vi = volume of first capillary 
volume of second capillary 


+ 
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V,, = volume of transducer 
a, = (py — — Pi) 
= (py + Pi)/(Ps — Pr) 

Experimental data obtained in the present work were superim- 
posed on certain of these computer solutions, together with the 
experimental uncertainty envelope, to indicate whether devia- 
tions between the various solutions were significant in view of 
experimental uncertainties. 

The several cases considered include: 


1 Usual capillary-transducer system in which all resistance is 
considered in the capillary and all volume is considered in the 
transducer. 

2 Orifice-capillary-transducer system in which a definite 
nonlinear pressure drop occurs at orifice. 

3 Capillary-capillary-transducer system in which the re- 
sistance and volume are both distributed throughout the 
system. 


Conclusions 


All conclusions are based on the graphs given in Figs. 6 through 
9 and others not illustrated (note that results in Figs. 6,7, and 9 
are plotted on a semilog co-ordinate system to bring out the 
linearity of the proposed equation). 
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LEVELS) 


% DIFFERENCE, ( 


1 From Fig. 6 it is concluded that all equations, linear and 
nonlinear, reliably predict response times for pressure differences 
on the order of 1 per cent of the pressure level. It is further con- 
cluded from this figure that the nonlinear equations are less 
affected by increases in the size of the step change in pressure than 
are the linear equations. Finally, it is noted that the linear 
equation developed in this paper, namely, equation (29), is in 
closest agreement with the nonlinear equations over the transient 
range of most general interest (i.e., from 25 to 99 per cent re- 
covery), no matter what the size of the step change in pressure. 

2 The advantage of the linear over the nonlinear equations is 
that the constant slopes of the linear plots insure that time con- 
stants can be defined, while definition of a time constant is pre- 
cluded by the varying slope of any nonlinear equation. 

3 From Fig. 7, the excellent agreement at various pressure 
levels between experimental points and the linear ‘equation pro- 

posed in this paper is noted. 

4 In Fig. 8, the effect of change in length ae change in 
diameter of the capillary on this time constant of the linearized 
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Fig. 7 Comparison of experiment versus linearized theory at various 
pressure levels 
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Fig.6 Comparison of several expressions for response time after various Fig. 8 Effect of change in length and change in diameter on the time 


step changes in environment pressure 
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Fig. 9 Effect of various orifice geometries on the time constant of the 
linearized expression 


expression is noted. Experimental points are superimposed on 
this plot. Response time is seen to vary directly with length, i.e., 
halving the length halves the response time. Much more im- 
portant is the reduction in response time realized by increasing 
the capillary diameter since time varies inversely as the 4th power 
of the capillary diameter. 

5 In Fig. 9, it is confirmed that the “high-speed’’ orifice of 
Ref. [16] is faster than usual long-passage orifices and that, in 
general, orifices of small lengths compared with capillary lengths 
and of diameters greater than '/, of the capillary diameters in- 
troduce nonlinear pressure drops which are insignificant com- 
pared with the quasi-linear pressure drops across the capillaries 

6 In view of the experimental evidence, and considering the 
good agreement between equation (29) and the more general 
(although more unwieldy) nonlinear equations of Refs. [5] and 
[19], it is recommended that equation (29) be used to represent 
transient response time for pressure-sensing systems because of its 
simplicity, utility, and for the ease with which it may be related 
to the elementary notions of fluid mechanics. 
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and Their Models 


The author rejects, momentarily, actual experimentation with real heat exchangers for 
logical cogitation and arm-chair experiments. Observed as mental images or models, 


these heat exchangers reveal their basic characteristics—characteristics that will be par- 
tially obscured in an actual experiment by secondary effects inherent in the equipment. 
These basic characteristics, once perceived, may be used to correlate by means of frequency- 
response diagrams many results heretofore diverse and seemingly unrelated, that have been 
reported in theliterature. Included among such results are not only the dynamics of vari- 
ous types of heat exchangers but also the thermal dynamic interaction between fluids and 
the confining pipe used for their transport. In addition, the author warns that all too 
frequently the use of the average temperature in heat-exchanger dynamics is based on 


faulty logic. 


of heat between fluids through metal 
walls and its transport down metallic tubes is an ever-recurring 
problem in industry. For many years this problem has been 
solved, giving attention only to the steady-state part of the solu- 
tion. The transient part of the solution has only recently be- 
come of sufficient importance to draw considerable effort in re- 
search. This shift toward more emphasis upon transient be- 
havior has been caused by two factors; namely, the greater appli- 
cation of automatic control which is absolutely dependent upon 
the transient state, and the greater speeds at which some 
heat processes now operate, thus demanding prior knowledge of 
transient behavior. 

Mathematical analysis of the transient state is much more 
complex than the relatively simple analysis required for the steady 
state. Differential equations for the transient state require, in 
addition to the parameters associated with steady-state behavior, a 
new set of parameters connected with dynamic behavior. 
These more complex equations can still be solved in terms of 
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steady-state solutions. But the solutions are of a unique nature. 
They are sinusoids and neither differentiation nor integration is 
able to alter their shape. Joseph Fourier showed us more than a 
hundred years ago that such sinusoidal solutions and periodic 
transients are mutually convertible. It is interesting to note that 
his researches in this connection were also concerned with tran- 
sient heat transfer. 

The frequency-response method of analysis is today the most 
commonly used method for studying the transient behavior of 
heat exchangers. This is because frequency-response data are 
compatible, while transient-response data are not. The over-all 
dynamic behavior of several pieces of equipment can readily be 
synthesized if their frequency-response characteristics are availa- 
ble. This cannot be done as conveniently when only transient- 
response data are given. 

We will therefore emphasize the frequency-response aspects of 
heat-exchangers and pipelines in this paper. We will survey the 
literature for pertinent data and we will compare the data on a 
frequency-response basis as this will make the information im- 
mediately useful to both the control engineer and to the designer 
of heat-exchanger equipment. 

However, comparisons of the dynamic behavior of apparently 
dissimilar pieces of equipment must be made on a rational basis, 
using the most fundamental of the parameters that characterize 
dynamic behavior. Our problem then is first to discover these 
basic parameters out of the multitude which can be written, 


Nomenclature- 


heat-transfer area, sq ft 

parameter, defined by Equation 
(50), 1/ft 

parameter, defined by Equa- 
tion (51), dl 

capacitance of liquid slug in 
tube, Btu/deg F 

capacitance of liquid in shell, 
defined by Equation (25), 
Btu/deg F-ft 

specific heat, Btu/lb-deg F 

ratio, defined by Equation (58), 
ft 

inside tube diameter, ft 

outside tube diameter, ft. 

base of natural logarithms, 
2.718...., dl 
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heat-transfer film coefficient, 
Btu/sec-sq ft-deg F 

V-1 

constant, defined by Equation 
(5), dl 

constant, appearing in Fig. 11, 
deg F 

parameter, defined by Equa- 
tion (53), 1/sec 

parameter, defined by Equa- 
tion (52), 1/sec 

constant in Equation (36), dl 

length, ft 

mass, lb 

an integer, 0, 1, 2, 3,..., dl 

steady average temperatures, 
deg F 


temperature perturbations, deg 


time, sec 

Equation (27), sec 

Equation (30), sec 

pulse duration, defined by 
Equation (31), sec 

over-all coefficient of heat 
transfer, Btu/sec-sq ft-deg F 

steady average flow velocity, 
ft/sec 

flow velocity perturbation, 
ft/sec 

steady average weight flow 
rate, lb/sec 

weight-flow-rate perturbation, 
Ib/see 

(Continued on next page) 


1960 / 489 


pe 
qa, = F 
j= t= 
t. = 
Ke = At = 
C, = = = 
c= 
D= V= 
We = 
d,d, = M = 
d; = n= = 
e= Tots T 
ae 


depending upon the various permutations and combinations 
allowed, with the design and operating variables of our equip- 
ment. Quoting from “How to Hunt a Submarine,’’ by Morse 
and Kimball [1]': “It is important first to obtain an over-all 
quantitative picture of the operation under study. One must 
first see what is similar in operations of a given kind before it will 
be worth-while seeing how they differ from each other. In order 
to make a start in so complex a subject, one must ruthlessly strip 
away details (which can be taken into account later), and arrive 
at a few broad, very approximate ‘constants of the operation.’ 
By studying the variations of these constants, one can then 
perhaps begin to see how to improve the operation. It is well 
to emphasize that these constants which measure the operation 
are useful even though they are extremely approximate .. .”’ 

We will discover these constants of the operation by the 
simplest of analyses which are promised in advance to be neither 
rigorous nor difficult, but nevertheless illuminating. Then we 
will test these constants against data reported in the literature. 
Such data were usually obtained at the expense of considerable 
effort and they should be consulted by the reader wishing greater 
detail in the analysis. Data which appear in the form of time 
or space transients [2] or as polar plots [3, 4] in the literature 
have been converted to frequency-response diagrams on Cartesian 
co-ordinates with logarithmic scales for the purpose of this paper. 

Because we wish to emphasize the dynamic aspects of heat- 
exchange processes we shall ignore the static relationships by 
normalizing the amplitude ratio of our frequency-response 
curves; i.e., we shall only show the dynamic amplitude ratio 
relative to the static or very slow dynamic amplitude ratio as 
unity. The static relationships are today rather well understood 
and need no discussion here. 

Finally, we shall show that the commonly used [5, 6] average 
temperature in the analysis of heat-exchanger dynamics is all 
too often based upon faulty logic—that its use can lead to erro- 
neous conclusions concerning control-system stability. 


Model 1 


Fig. 1 represents a simple heat exchanger. The water in con- 
tact with the steam chest is well stirred assuring a uniform tem- 
perature throughout mass M. Temperatures 7), 72, and 7; 
represent fluctuations from their mean values 7, Tm, and 7's. 
Higher subscript numbers indicate higher mean temperatures. 


1 Numbers in brackets designate References at end of paper. 


Nomenclature 


WATER 
W,+W— 4UA 


STEAN— 


Wo+W 


T, 


Fig. 1 Model 1 


W is a weight-flow perturbation about Wo. Let c represent the 
specific heat of water. U is the over-all coefficient of heat trans- 
fer between the steam and the water above it through the thin 
metal plate of area A. Other simplifying assumptions asso- 
ciated with the model are implied in the equations which follow. 
The frequency response relating sinusoidal fluctuations in 7’; to 
those in 7°, is given by 


Ti(jw) 1 + + jQ 40 
Equation (2) defines r,,. It is the average time required 
M 
= — 2 
Tada Wo ( ) 


by a particle of water to traverse the heat exchanger, hence it is 
also known as the average dwell time. And ©,, is the dimen- 
sionless circular frequency normalized on 7 ,, as defined by 

= Tag (3) 


The frequency response relating 7; and 7 is given by Equa- 
tion (4), 


TX jw) 1 + jwrn 
with zero-frequency amplitude ratio K, defined by 


UA 


UA (5) 


and time constant 7; defined by 


equivalent flow diameter of 


average dwell time, defined by 


= dimensionless 


circular _fre- 


shell-side fluid, ft Equation (2), sec quency, defined by Equation 
w = 3.14159... T, = time constant for fluid inside (8) 
p = weight density, lb/cu ft tube, defined by Equations 
= “sum of” (20) and (52), sec = dimensionless circular _fre- 
T) = equivalent dead time, defined T, = time constant for shell-side quency, defined by Equation 
by Equation (64), sec fluid, defined by Equation (15) or (57) 
7, = time constant, defined by Equa- (21), sec 
tion (6); also by Equation Ta: = dwell time for tube fluid of 2- — a or 
(59); also by Equation (62), pass heat exchanger, defined - quency. defined by Equation 
sec by Equation (22), sec (3) 
T: = time constant, defined by Equa- Ta, = dwell time for shell fluid of 
tion (60); also by Equation 2-pass heat exchanger, de- a a 
(63), sec fined by Equation (23), sec . quency, defined by Equation 
Ts; = time constant, defined by Equa- T, = time constant for metal of tube, (38) 
tion (61), sec defined by Equation (53), 
T, = time constant, defined by Equa- sec 
tion (48), sec w = circular frequency, radians/sec ae 
T4 = dwell time, defined by Equa- Q = dimensionless circular _ fre- t = tube 
, tion (12), sec quency m = metal 
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Fig. 2 Impulse response of Model 1 


Me 


UA 


(6) 


We normalize Equation (4) and write it as 
Tx jw) 1 


T(jw) 1+ 


with 2, the dimensionless circular frequency normalized on 7; as 
defined by 


= rw (8) 


Equations (1), (3), (7), and (8) allow us to describe the dynamic 
behavior of this heat exchanger, with either 7 or 7; as input, 
and 7; as output, by means of a single set of frequency-response 
curves defined by the following: 


AR = (9) 


a+ oy 


Phase angle = —are tan 2 (10) 


where AR stands for normalized amplitude ratio, and Q is 
circular frequency normalized on the appropriate time constant. 

A flow perturbation W corresponds to parametric forcing 
because we effectively perturb the average dwell time of the 
system T,,. However, small flow oscillations W about the 
mean value of 7,, will not appreciably alter the dynamic picture 
from that presented by oscillations in temperature 7’). 

The frequency response of this heat exchanger may also be 
found by Fourier transforming the impulse response. For im- 
pulses applied at 7,, W, or 7’; the response of 7; is shown in Fig. 2. 

A normalized initial response magnitude is again used. Equa- 
tions (9) and (10) may also be derived from the Fourier trans- 
form of the impulse response shown in Fig. 2. 


Model 2 


Fig. 3 is the second model. Water flows with velocity Vo 

through a thin metal tube of length LZ. Steam surrounding the 

‘ tube keeps the metal at a uniform temperature throughout its 

length. The water flowing within the tube has a steady average 

temperature gradient in the direction of flow. Assume that each 

small slug of water inside the tube is perfectly mixed transversely, 
but that no mixing takes place longitudinally. 

We wish to relate sinusoidal fluctuations in steam tempera- 
ture 7’; to outlet-water temperature fluctuations T,. Analysis 
of the problem is complicated by the fact that we must now con- 
cern ourselves with both space and time co-ordinates to describe 
the temperature of the water inside the tube. Hence, partial 
differential equations are called for. This is the normal ap- 
proach. 

But let us simplify the problem of understanding the basic 
dynamics of this heat exchanger. What would happen if we 
applied steam-temperature disturbance 7’; in the form of an im- 
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L TIME — 


° 
Fig. 4 Impulse response of Model 2 


pulse? Fig. 4 shows the idealized response of the outlet tempera~- 
ture change 7;. Fourier transforming this response, or, alter- 
natively, Laplace transforming it and subsequently substituting 
jw, we get the basic frequency-response characteristic of this 
heater relating 7; and T;, 


ll 
Ti(jw) jw 


The dwell time 7, of the liquid within the heat exchanger is 
given by 


L 
Ve (12) 


It is more convenient, for plotting purposes, to reduce Equation 
(11) to the expressions, 


(13) 


WT, 


Phase angle = — = (14), 
Equation (13) has been normalized to zero-frequency amplitude 
ratio of unity. It appears then that half the dwell time is the 
prime parameter relating 7 and 7’; for this type of heat exchanger. 
We therefore normalize the circular frequency on 7,/2 as defined 
by 


(15), 
The normalized amplitude ratio and angle of lag relating steam- 


temperature changes 7; to heated-water fluctuations T; are 
given by 


sin 
AR = — 
R (16) 
Phase angle = —(2, (17) 
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Fig. 5 Frequency response of heat exchangers 


Catheron, Goodhue, and H Steam in shell (steel), water in tubes 
(copper), 6-passes; 4 X 7 and 2 X 6 tubes, 0.75 in. OD X 0.042-in. 
wall, heat-transfer area 31 sq ft, water path 24 ft long, 30 Ib water in 
tubes and head, input was sinusoidal stm press changes, output was 
sinusoidal heated-water temperature changes. 

A, 20 gpm, rz = 10.8 sec, 2 = (5.4 sec) (w rad/sec) 

[, 40 gpm, ra = 5.4 sec, 2 = (2.7 sec) (w rad/sec) 

Lees and Hougen: Steam in shell (steel), water in tubes (copper), 4 
passes; 2 bundles of 11 tubes, 0.75 in. OD XK 18 BWG, heat-transfer 
area 34 sq ft, water path 16 ft long, 30 Ib water in tubes and head, 
input was impulselike change in water flow, output was heated-water 
temperature response. Results were Fourier transformed to obtain 
freq response. ©, 30 gpm, ry = 7.2 sec, 2 = (3.6 sec) (w rad/sec) 
Cohen and Johnson: Steam in shell, water in tube, single pass, 1 in. and 
2-in. concentric brass pipes, 11.6 ft long, heat-transfer area 4 sq ft, 4.3-lb 
water in tube. 

O, 1.44 Ib/sec, rg = 3 sec, Q = (1.5 sec) (w rad/sec) 

Industrial Heat Exchanger: Organic liquid in shell (steel), organic liquid 
in tubes (steel), 184 tubes 0.75 in. OD K 14 BWG X 192 in. long, 
heat-transfer area 575 sq ft; input was sinusoidal changes in tube flow, 
output was sinusoidal changes in tube outlet temperature, heat flow was 
from tube liquid to shell liquid, frequency response includes unknown and 
unavoidable thermowell time constant. 


@, ra = 30 sec, 2 = (15 sec) (w rad/sec) 


Numerical values for Equation (16) are readily available [7]. 
Both Equations (16) and (17) are shown plotted in Fig. 5. This 
figure also shows the remarkable frequency-response correlation 


space. The impulse response 7, will then again be as shown 
in Fig. 4, except for a reversal in sign, a positive impulse pro- 
ducing a temperature drop in T:. But there will be no sign re- 


which exists, through Equation (15), between different heat 
exchangers operating under various conditions. 

Our next concern is the relationship between velocity of flow 
changes V and the output temperature disturbance T,. Again, 
we may apply the concept of the impulse response. A positive 
impulse V will instantaneously displace the water column 
inside the tube a small distance to the right. This will have the 
effect of reducing by the same amount the time which all slugs 
of water in one tube length have to pick up heat. We assume 
that the thermal time constant associated with each slug of water 
is much longer than the dwell time 7, of the heat exchanger. 
This is a fair assumption as the exit-water temperature is nor- 
mally considerably below the steam temperature in the shell 
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versal in case a hot liquid inside the tube is used to transfer heat 
to a well-mixed cooler liquid in the shell space surrounding the 
hot tube. Thus flow changes V and tube-liquid exit tempera- 
ture changes 7; are related by the frequency-response Equa- 
tions (13) and (14), keeping in mind to add 180-deg phase lag 
to Equation (14) if heat transfer is into the tube. The frequency 
response for tube-flow changes may also be found in Fig. 5. 
What effect will temperature changes 7; have upon T,? A 
temperature impulse 7’, will be transmitted down the tube in the 
form of a hot slug of water with velocity Vo, appearing at the exit 
after elapse of the dwell time 7,. We may inject a train of con- 
stant-magnitude impulses at 7), either in rapid sequence or 
slowly. All pulses will appear at 7. with the same magnitude, 
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not necessarily the same magnitude as the injected pulse at 7). 
Thus the magnitude of the frequency-response function is a 
constant, which we normalize to unity. And the phase shift 
is given by 

Phase angle = —Tw (18) 
which, if normalized on half the dwell time, may be written as 
—2Q, (19) 


Phase angle = 


WATER 


Fig.6 Model 3 


WATER 
= ZERO 
[ REFERENCE 

TEMP. 


Another model, that represented by Fig. 6, presents an inter- 
esting problem. Water flows on both sides of the tube which 
has a very thin metal wall. For simplicity assume that the 
tube and shell-side velocities are the same V». The tube 
diameter is d, the weight density and specific heat of water p and 
c, respectively. The over-all coefficient of heat transfer through 
the tube wallis U. The time constant associated with a slug of 
water inside the tube is given by 

dc 


= = (20 
4rd’ 


t 


while the time constant for a shell-side slug of water is 


pc pc 
T,.= = 
4dU 


Here 6 is the equivalent flow diameter of the shell fluid. 
Dwell time of the liquid inside the tube is given by 


2L 
Ta = Ve 
and dwell time of the shell-side liquid by 


Ta, Vo 
Assume that 7, is appreciably smaller than 7 ,,/2. 

We wish to find the dynamic relationship between the tem- 
perature entering the tube 7; and that leaving the shell 7). 
Again, let us apply an imaginary impulse 7); i.e., let us suddenly 
inject a great desl of thermal energy into a small slug of water 
at the entrance of the tube. Immediately, some of this energy 
will be transferred through the tube wall into the shell-side slug 
just emerging from the heat exchanger. Assume that this jump 
in temperature of the shell-side slug is 7; = 100 deg F, at time 
t = 0. The tube-side slug continuously transfers energy to the 
shell-side fluid at a rate given by 

Rete = (24) 

where Co is the capacitance of the tube slug. The energy ab- 
sorbed by the shell-side fluid per unit time is distributed over a 
shell-side fluid length equal to the relative velocity between the 
entering tube-side slug and a shell-side slug; namely, a length of 
shell-side fluid equal to 2 Vo. The capacitance per unit length 
of the shell-side fluid is 
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(25) 


We may now calculate the initial temperature pulse by solving 
Equation (26) for 72: 


(26) 


Next, we calculate the temperature of the hot slug inside the 
tube after a time interval given by 


L 
4 = - 


= T,,/2 
Ve dt 


(27) 
At this time the hot slug has reached the far end of the heat ex- 
changer. During all this time the hot slug was exposed only to 
zero-reference temperature fluid on the shell side. The tempera- 
ture of the hot slug at time ¢, is therefore, 


Ty’ = (28) 
And the temperature of the fluid on the shell side is now Equa- 
tion (29). 


(29) 


After this time the hot slug will reverse the direction of its 
travel and will stay with the cold shell-side slug which has just 
entered. The two slugs, assumed to be of equal longitudinal 
thickness, will now travel to the left in Fig. 6 until they reach the 
outlet of the heat exchanger at the time, 


2L 


= Ve (30) 


= Ta 


Heat transfer between the two slugs as they travel side by side 
will occur at a rate governed by the time constants of the two 
slugs and the decreasing temperature difference between them. 
Solution of the two differential equations, analytically or graphi- 
cally, readily yields the temperatures of both slugs. We are 
interested only in the temperature of the shell-side slug. Let 
it be 7,” at time &, the moment when the hot slug in the tube 
leaves the heat exchanger. 


From Co» we can calculate the pulse duration At. It is 


ope 


(31) 


We now know the impulse response 7';, Fig. 7. 
part of the transient is described by 


The exponential 


at 


TIME — 


Fig. 7 Impulse response of Model 3 
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Co 
= T, = 100 = 
wt, 
Model 3 ; 
= 
WL, an 
é 
| = 
| 
2 
| 
100 
T. 
% 
t, s 
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T, = 100e-"/2%, O0<t<h (32) 


This is caused by spreading the dynamic transfer of energy from 
the hot slug with time constant 7, over shell-side fluid twice the 
length of the heat exchanger. 

To get the frequency response of this heater, relating 7, to 72, 
we take the Fourier transform of the impulse response. The 


result is 
100 (2r,) + jw 
7’ — 27 
—jwAdt 
+ Vo [1 — e~ (33) 

jw 

Now, 


= cos wAt — j sin wAt 
~ 1 — jwAt (34) 


Substituting Equation (34) for the last exponential in Equation 
(33), and using Tz, = 2L/Vo, we get 


—e \?, 


+ at 


100(2 
1 + j2wr, 


(35) 


Our assumption following Equation (23) allows us practically to 
ignore the quantity 


—jwrg 
in comparison to unity. 
The frequency response of the heater is obtained from the unit- 


impulse response. We must therefore divide Equation (35) by 
T; = KAt, the magnitude of the applied pulse. In addition we 


normalize to unity amplitude ratio at zero frequency. The 
frequency response becomes then 
T (jw) 1 + (36) 
€ dt 
T (jw) 1 + j2wr, 


with k a small quantity. Thus, on a logarithmic plot, at low 
frequencies, the first term of Equation (36) dominates, giving us 
the appearance of a first-order time-constant system. At higher 
frequencies, with the first term of Equation (36) smaller, the 
second ‘term makes its appearance upon the logarithmic plot as 
a periodic undulation with peaks at 

= = 0,1,2,... (37) 
Masubuchi [8] does show such peaks for this type of heat ex- 
changer. Note that the undulating nature of the frequency- 
response equation is produced by the second term of Equation 
(36), which arises in the parallel-flow part of the heat exchanger. 
Equation (36), normalized for frequency on twice the tube-slug 
time constant, using Equation (38), becomes 
Q, = 2rw (38) 

Ti(jw) 1 9 


= k (39) 


T(jw) 


Model 4 


Model 4, Fig. 8, demonstrates the dilemma we face when the 
average temperature is used in the analysis of heat-exchanger 
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Fig. 8 Model 4 


dynamics. This model is found frequently in the nuclear- 
reactor-control literature [5, 6]. 

Heat is transferred from the metal plate with area A and uni- 
form temperature 7’,,, + 7',, to the water through film coefficient 
h. A linear temperature gradient is assumed to exist in the water 
between inlet and outlet. 

The analysis, though illogical, usually proceeds as follows: 
The bulk temperature of the water in the heat exchanger is as- 
sumed to be equal to the arithmetic mean of the inlet and exit 
fluid temperatures. We therefore write the heat balance for 


small temperature perturbations about previous steady condi- 
tions, 


d (T:+T 


We wish to find the frequency response between 7; and 7;. For 
this we keep metal-plate temperature perturbation T',, at zero 
and substitute jw for the derivative in Equation (40). Solving 
the resultant equation for the ratio 7:/7T; we obtain the fre- 
quency response, 


Me 
Tx jw) 5” owe —ha 
2Wee + hA 


From Equation (41) we find the amplitude ratio and phase angle 
to be 


(2Wee — hA)? + (wMc)? 
AR = |‘ 9 
+ hA)? + (42) 
M 
Phase angle = —arc tan arc tan 
(43) 


Equations (42) and (43) lead us to conclude that a very high- 
frequency temperature fluctuation at the inlet to the heat ex- 
changer is transmitted to the outlet without attenuation but with 
180-deg phase shift. Thus a sudden increase of inlet tempera- 
ture 7’, will produce an equally sudden decrease of outlet tem- 
perature 7’; equal in magnitude to 7T;. This is hardly a satis- 
factory conclusion. 

We try next a new, but more logical analysis. The assumption 
that the bulk temperature of the water in the heat exchanger is 
equal to the arithmetic mean of the inlet and outlet fluid tem- 
peratures prescribes a model with the bulk temperature rigidly 
fixed to inlet and outlet temperature perturbations. A sudden 
increase of inlet temperature amounting to 7, degrees will there- 
fore without the slightest delay, in fact, immediately, increase 
the bulk temperature by 7/2 degrees. While we may say that 
this is impossible physically, our major assumption has created 
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: 
i” 
; dt 2 ) 
Pia 


a mathematical model which does behave in this manner. But 
perhaps we will find the end result, the relationship between 7; 
and 7'2, to be more agreeable to us than that given by Equations 
(42) and (43). 


Our heat-baiance equation should be written 


T + 


aT 
— WeT: = (Mc) (¥ (44) 


The term 
1/5(d7T/dt) 


shows that the changing inlet temperature, in accordance with 
our basic assumption, must be reflected in a changing bulk tem- 
perature at a rate half that at which 7; is changing. And this 
rate of change of bulk temperature inescapably implies an 
additional rate of heat flow, 


into the heat exchanger which must be accounted for. Finally, 
the incoming and outgoing heat flows must be balanced against 
the heat rate to bulk storage as reflected by the rate of change of 
outlet temperature; namely, 


Equation (44) can be rewritten as 


T: + T; 


= (= (45) 


dt 2 


Equation (45) shows, logically so, that the net heat rate to bulk 
storage is proportional to the rate of change of the temperature 
difference between outlet and inlet temperatures. The net heat 
rate to bulk storage is not proportional to the sum of the rates 
of change of inlet and outlet temperatures, as implied, illogically, 
by Equation (40) 

The frequency response relating 7; to 7;, when derived from 
Equation (45), is given by the following two equations: 


Me 
T(jw) \2We +ha 
+hA 
Phase angle = —are tan pare 
wMe 
tan ————_ (47 
+ are tan — hA (47) 


The amplitude ratio is unchanged from Equation (42). 

Both Equations (40) and (45) give the same correct relation- 
ship between metal-temperature perturbations 7',, and response 
of the outlet temperature 7>. But the frequency response be- 
tween 7; and T; is very different for the two equations. The 
polar diagram in Fig. 9 gives a qualitative illustration of this 
difference. 

Clearly then, in considering stability of control for nuclear 
reactors, one should be aware of the limitations and possibly 
dangerous assumptions connected with the use of this model in a 
careless manner. 
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Fig. 9 Polar diagram for frequency response of Model 4 
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It is interesting to note the time constant 7,, relating T,, to T2: 


2Wee +h 
This is appreciably less than the time constant 7,, Equation (6), 
associated with the mixed fluid of Model 1. Mozley [9], in com- 
paring Models 1 and 4, but as applied to two liquids flowing 
countercurrent, found Model 4 to be in better agreement with 
experimental] data because of the smaller time constant. 


In our first four models we have ignored the heat-capacitance 
effect of metal walls in contact with the flowing fluids. In this, 
our last model, Fig. 10, we intend to consider the dynamic tem- 
perature interaction between a flowing fluid and its metal con- 
tainer. 


To Th To, + T, 


Fig. 10 Model 5. Section through insulated metal pipe. 


We assume the outside of the pipe to be insulated. The pipe 
itself has finite thermal capacitance and negligible thermal re- 
sistance at right angles to its axis, but infinite resistance in the 
direction of the axis. This system was analyzed by Profos [3}. 
A brief explanation of the method of calculation used by Profos 
may be found in the book by Young [10]. A solution is listed 
in reference [10], giving temperature perturbation 7, of a fluid 
at a downstream distance L when the inlet temperature is sub- 
jected to a sinusoidal oscillation of magnitude 7; at circular 
frequency w. Equation (49) is that solution 


T; = sin w — a (49) 
Vo 
K, w? 
= K,? + =) (50) 
KK, 
a = (Aas, + 1) (51) 
(52) 
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K, = 
— 


-+ (53) 

Tm 
Here 7, is the time constant of the fluid and 1,, is the time con- 
stant of the metal wall. Vo is fluid velocity. 

While Equation (49) is not difficult to solve, it is time-con- 
suming to plot various frequency-response curves for dynamic 
studies of systems which include pipes of assorted lengths and 
diameters conveying fluids at various speeds. Is it possible to 
construct approximate frequency-response curves quickly by 
utilizing easily calculated break-frequencies to locate the asymp- 
totes for the logarithmic amplitude-ratio curves? Rizika [2] 
has published valuable data on temperature transients for gaseous 
fluids in pipelines. Can we convert his data to frequency- 
response curves? What major parameter should be selected on 
which to base our correlation? 

Let us again approach this problem from the impulse-response 
point of view. Suppose we start a high-temperature slug of fluid 
down the tube. The impulse response 7; will be approximately 
as shown in Fig. 11. After elapse of the dwell time 7,, the tem- 
perature 7’, will show a pulselike response KA», but of 
smaller magnitude than the initial pulse because of the expo- 
nential transfer of heat to the metal of the tube at a rate governed 
by the time constant 7, associated with the fluid. Down- 
stream temperature 7, after responding to the pulse at time 
T,, Will continue to register a higher temperature for a period 
largely governed by the time constant 7,, of the metal wall 
which determines the exponential rate at which heat is returned 
to the fluid. An exponential decays to 2 per cent of its initial 
value in a period of time equal to four time constants. We con- 
clude therefore that duration of the transient response is from 


T, until t, + 47,,.. The Fourier transform of this impulse re- 
sponse is 


T (jw) [e-%r4 hrm) | (54) 


Because e~‘ is only 0.02 it follows that the magnitude of the 
second term in brackets may be neglected in comparison to the 
first term. Equation (54) is therefore practically equivalent to 


We normalize again to unity amplitude ratio at zero frequency, 
yielding the frequency response, 


T\(jw) 1 


e (56) 


Now we are confronted with the choice of using either metal 
time constant 7,, or dwell time 7, as our basic parameter. We 
suspect that knowledge of their relative magnitudes may be help- 
ful in making this choice. Profos [3] gives us a good example. 
He compares two pipes, each 10 m long. Water flows in one of 


496 1960 


them, at a velocity of 0.5 m per sec. ‘The other pipe carries low- 
pressure steam (1.1 atm abs, 200 C) at a velocity of 34 m per 
sec. The water pipe has an internal diameter of 19.6 mm and 
a 2.5-mm wall thickness. Corresponding dimensions for the 
steam pipe are 59 mm and 7.5 mm, respectively. The film 
coefficients of heat transfer are: For the water system, 2000 k- 
cal/m*-deg C-hr; for the steam system, 150 k cal/m*-deg C-hr. 
Specific heat and density of the steam are 0.5 k cal/kg-deg C and 
0.5 kg/m?, respectively. For the pipe metal these quantities 
are approximately 0.13 k cal/kg-deg C and 7800 kg/m*. Caleu- 
lated time constants are shown in Table 1. 


Table 1 Time constants for steam and water pipes 


Water Steam 
Dwell time, ra, see... ..... 20 0.29 


Table 1 immediately leads us to test 7, for the water pipe and 
T,, for the steam pipe as the characteristic parameters. We shall 
therefore plot the frequency-response data reported by Profos 
for the water pipe in polar co-ordinate form on our rectangular 
co-ordinate system but normalized to unity amplitude ratio at 
zero frequency. For the normalized circular frequency we will 
be consistent with Model 2. The water pipe, then, will be nor- 
malized on the circular frequency defined by 


w (57) 


The filled-in circles in Fig. 12 are the result. At {2 = 1, the char- 
acteristic frequency, we find w = 2/7, = 0.1 rad/see. If the 
water pipe had been a lumped first-order system, as implied by 
the amplitude of Equation (56), we would have seen the char- 
acteristic break frequency at w = 1/7,, ~ 0.2 rad/see. But the 
water pipe is not a lumped first-order system. That is also 
shown by the increase with frequency of the slope of the ampli- 
tude-ratio curve. First, the curve matches the first-order 
dashed curve in Fig. 12 marked 7;. Then, between 2 = 1.5 
and 3 it fits the second-order curve marked 7, + 7». 

The phase angle, to our considerable satisfaction, agrees 
perfectly over the range shown in Fig. 12 with the curve of a 
first-order system plus dead time as described by Equation (56). 
The characteristic frequencies involved are w = 2/T,g = 0.1 
rad/sec, as before, plus the one caused by dead time, or dwell 
time; namely, w = 1/7, = 0.05 rad/sec. Corresponding normal- 
ized frequencies, defined by Equation (57), are 2 = | and Q = 
0.5. 

For the gaslike fluid, steam, we suspect that the small value 
of 7, in comparison with 7,, will make its dynamic effect negli- 
gible. But data provided by Rizika [2] show that a single time 
constant will not suffice for any but the shortest pipe lengths when 
gaseous fluids are involved. However, for short pipe lengths 
T,, is indeed adequate. 

By plotting the transient remainders of the family of transient- 
response curves, published by Rizika, on semilog paper it is 
possible to extract without much difficulty the time constants 
which may be approximately associated with the transients. 
An earlier paper by the author [17] has shown that two or three 
time constants plus dead time can be combined in various pro- 
portions to yield an essentially identical step transient. The 
only requirement, within limits, appears to be that the summation 
of all time constants plus dead time, if any is used, be a constant 
for a particular step transient. With this thought in mind the 
combination of time constants extracted from the Rizika tran- 
sients were plotted as shown in Fig. 13. The dashed lines pro- 
vide a convenient approximate relationship between time con- 
stants and pipe length. The time-constant scale in Fig. 13 is nor- 
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Fig. 12 Frequency response of metal pipes (based on Profos’ data) 
Steam Pipe (steel) 59 mm ID X 7.5-mm wall. 

(1) L-P steam: 1.1 ata, 200 C, 34 m/sec, r, = 214 sec 4, 2.5-m long 
pipe, L/D = 0.835, r; = 303 sec, 2 = (303 sec) (w rad/sec) 

o, 10-m long pipe, L/D = 3.33, 7; = 570 sec, 2 = (570 sec) (w rad/sec) 
9 , 40-m long pipe, L/D = 13.3, 7; = 1640 sec, 2 = (1640 sec) (w rad/- 
sec) 

(2) H-P steam: 100 ata, 400 C, 34 m/sec, r,, = 10.7 sec ©, 50-m long 
pipe, L/D = 3.0, 7; = 26.8 sec, 2 = (26.8 sec) (w rad/sec) 

Water Pipe (steel): 19.6 2.5-mm wall 

Water: 0.5 m/sec, r, = 4.8 sec 

@, 10 m-long pipe, rz = 20 sec, ry = 20 sec, 2 = (r4/2)w = (10 sec)- 
(w rad/sec) 


malized on 7,,, while tie abscissa is a dimensionless pipe distance 
normalized on the quantity D, in feet, also used by Rizika, and 


defined by 


heat-capacitance flow rate Woe 
conductance for heat flow adh 


between fluid and pipe wall, 
per unit length of pipe 
Finally, we choose as our major parameter for the steam pipe the 


major time constant from Fig. 13; namely, 7;, which is a function 
of r,, defined by 


(: + '/, T (59) 


The Profos curves for steam pipes, which include the system 
under discussion, are shown replotted in Fig. 12 after normalizing 
to 7. With Rizika data extending only to a normalized pipe 
length of 5 it becomes necessary to extrapolate these data far 
out to a normalized pipe length of 13 to enable the high-pressure } 
(100 atm abs, 400 C) long (50 m) pipeline data of Profos to be 1 
placed in Fig. 12. 

Figs. 12 and 13 show that more time constants are necessary to NORMALIZED PIPE LENGTH - L/o 


approximate the frequency response of the longer steam lines. Fig. 13 Time constants for gaseous flow in metal pipes (based on Rizika 
These additional time constants were chosen, again approximately _—_ deta) 
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and for simplicity, as simple fractions of the major time constant; 
namely, 


= (60) 
T= 71/6 (61) 


Additional Observations 


DeBolt [11] has tested a heat exchanger with an unusually 
large sinusoidal pneumatic-pressure fluctuation applied to the 
control valve admitting saturated steam to the shell. This 
pressure was varied between 10 and 14 psig, providing a 2-psi 
amplitude. The effective working range of the */,-in. control 
valve was the standard 3 to 15 psig between full closed and full 
open. The heat-transfer area of the heater was 5.4sqft. Water 
flowed through the tubes in three passes, each about 4 ft long. 
Each pass had three °/;-in., 16 BWG admiralty tubes. Saturated 
steam to the control valve was held at 65 psia and 97 per cent 
quality. Condensate level was carefully controlled. Outlet- 
water temperature fluctuations were picked up by a bare thermo- 
pile with negligible time constant and put on a Sanborn re- 
corder. Frequency-response curves obtained by DeBolt at 6 
and 15 gpm, with corresponding dwell times 7, of 4 and 1.6 sec, 
are redrawn, normalized to 7,/2, in Fig. 14. He obtained the 
curve showing undulation at higher frequencies at a flow rate of 
2.4 gpm, with corresponding dwell time 7, of 10 sec. 

Frequency-response tests on the industrial heat exchanger re- 
ported in Fig. 5 with the filled-in circles were performed under 
actual operating conditions, without upsetting the process. 
Data had to be obtained at extremely low signal levels. They 
were recovered from the background “‘noise’’ by machine-correla- 
tion-computation techniques. This accounts for scatter of the 
points. An appreciable and unavoidable unknown thermowell 
time constant explains the deviation from the other points in 
Fig. 5. The data were included because they represent the 
largest dwell time 7, available to the author; namely, 30 sec, ten 
times the smallest shown. This heat exchanger also differed from 
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Fig. 14 Frequency response of a heat exchanger (based on DeBolt data) 


the others reported in that hot liquid inside the tubes was 
used to heat an organic liquid in the shell. 

The Lees and Hougen data shown in Fig. 5, together with 
those discussed in the preceding paragraph, relate tube-exit tem- 
perature fluctuations to tube-flow perturbations. This informa- 
tion confirms our thesis, proposed under Model 2, that dwell 
time 7, may be used as the major characteristic parameter for 
describing these heat exchangers; that is, for relating either tube- 
flow variations or shell-temperature perturbations to tube-outlet- 
temperature fluctuations. 

It would appear that the class of heat exchangers shown in 
Fig. 5 can be described, at least to a first approximation, by 
means of the frequency-response curves in Fig. 15. These 
curves contain a major time constant 7), plus a secondary time 
constant 72, and a dead time 7», all based on the dwell 
time 7 ,, as defined by the following equations: 


(62) 
= (63) 
To = T,/4 (64) 


For example, Catheron, Goodhue, and Hansen [12] report 
equivalent time constants for their heat exchanger and we com- 
pare these in Table 2 with values derived from Equations (62) to 
(64). (Nore: All time constants in Tables 2, 3, and 4 are 
given in seconds.) 


Table 2 Catheron, Goodhue, Hansen data compared 


At 20 gpm 1 tT 

Shell steam-flow perturbation........... 12 4 3 19 

Shell steam-pressure perturbation...... . 2 3 13 
From Equations (62) to (64), with rz = 11 


Another example is provided by the Lees and Hougen [13] data 
with tube-liquid flow changes in a heat exchanger quite similar 
to the one reported in Table 2. Their figures are compared in 
Table 3. 
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Fig. 15 Approximate frequency response of heat exchangers (based upon data in Fig. 5) 


Table 3 Lees and H 
At 30 gpm 


Tube-liquid flow changes.......... 
From Equations (62) to (64), with rg 


gen data compared 


=r 
10.8 


10.8 


T2 


1.41 


To 


1.75 


7.6 


4.2000.... 84 1.8 
Cohen and Johnson [14] used an 11.6-ft-long single-pass con- 
centric-tube heater, made of 1-in. and 2-in. brass pipe, with 


steam on the shell side. Their figures are compared in Table 4. 


Table 4 Cohen and Joh 
Water at 1.44 lb/sec 
Shell steam-flow perturbation (temp 
obtained from sat stm press). 3 
From Equations (62) to (64) with rg = 3 


3 0.75 0.75 4.5 


An example of transient-response correlation is provided by 
Rizika in a second paper [15] in which he computes the transient 
response of a Ross heater No. 812 (8 in. in diameter and 12 ft 
long, consisting of 72 /s-in-OD, No. 18 BWG brass tubes). 
This heater has been designed to heat, in the steady state, 6300 
gal of water per hr by condensing 5200 lb of steam (at 10 psig) 
per hr on the outer surface of the heat-transfer tubes. The dwell 
time under the specified operating conditions is listed by Rizika 
as T, = 5.65 sec. Hence, we would expect our major time con- 
stant to be 7, = 7, = 5.65 sec, with a secondary time constant 
and dead time each equal to 1.41 sec. Rizika calculates his 
transient under the following extreme conditions: Water is al- 
lowed to pass through the tubes at rated flow until the metal of 
the heater is at equilibrium with the cold water. All this time 
no steam is admitted. Suddenly the steam is allowed to flow. 
Rizika’s calculated transient, which shows no dead time and about 
0.5-see secondary time constant may be approximated with a 
maximum-slope line at the point of inflection. This line cuts the 
new steady-state condition line at almost 5 sec and the initial- 
condition line at 0.5 sec. Rizika’s calculated transient-response 
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curve follows the maximum-slope line for 2 sec. To a first ap- 
proximation we represent the Rizika transient by a 4.5-sec major 
time constant plus 0.5-sec secondary time constant, producing 
=r = 5sec. The =r based on 7, and Equations (62) to (64) 
would have given us 5.65 sec plus twice 1.41 sec or 8.5 sec. 
Apparently even such severe test results can be readily predicted 
with reasonable accuracy by using the data and methods provided 
in this paper. 

We add an observation concerning the interaction of hot gases 
with metal walls by quoting Cima and London [16]: ‘Another 
saving feature from a control point of view is that . . . the response 
to a flow step change is essentially complete within a time in- 
terval t = 7,, and this is substantially faster than the response to 
a temperature step change . . . Thus, as step-function changes in 
flow will also be accompanied by step-function changes in inlet 
temperature, one can expect the 50 or more times slower response 
to the inlet temperature changes to control the over-all behavior.” 

For our final observation we cite the point made by Rizika [2] 
in the last paragraph of his paper to the effect that it is entirely 
possible for a heat-exchanger and long-pipe combination to act 
dynamically as if the heat exchanger were not even present. 
We believe that this consideration may well be of importance to 
nuclear-reactor-control engineers especially in those cases where 
shielding necessitates long pipelines between the reactor core 
and the next major point of heat transfer. The data presented 
in this paper should provide a simple and ready means for deter- 
mining when such a difficult control condition is likely to arise. 


Conclusions ; 


1 Basic dynamic characteristics of heat exchangers can be 
found by analysis of very simple mathematical models. 

2 A single basic parameter may be used to describe dynamic 
characteristics of entire families of heat exchangers. 

3 Parameters which may be so used are dwell time of the 
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fluid within the heat exchanger and the metal and fluid time 
constants. 

4 Asingle dynamic parameter easily adjusted to fit numerous 
operating conditions should prove to be helpful in the design, 


purchase, use, classification, and standardization of heat ex- 
changers. 

5 Appreciable amounts of data produced by apparently di- 
verse researches on heat-exchanger dynamics may be correlated 
and presented in more condensed fashion. 

6 The average temperature in heat-exchanger analysis 
should be used with caution. 
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DISCUSSION 
Donald J. Bergman® 


One of the dynamic lags in a heat exchanger results from the 
necessity of measuring outlet temperature (for control) at a point 
fixed by the mechanical design of the exchanger. Ordinarily, 
this is close to the outlet flange. It is obvious that taking the 
measurement at a point 10 ft down the pipe from the exchanger 
outlet, with the heated fluid moving at*a velocity of 5 ft/sec 
adds 2 seconds to this particular time lag. 

In the firing control of oil heaters it was found out back in the 
1930's, mostly by intuition and trial, that inserting the control 


’Chief Engineer, Engineering and Development Department, 
Universal Oil Products Company, Des Plaines, Ill. Mem. ASME. 
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thermocouple perhaps 10 or 12 tubes ahead of the outlet shortened 
the apparent time lag and improved heater outlet temperature 
control. 

It would seem that this particular time lag of a heater or ex- 
changer could be approximated by an estimate of the tube length 
back to the center of gravity of the area between the two tem- 
perature curves and the time required for the fluid to flow from 
this point to the outlet. Location of a control couple at this 
point would probably result in unsatisfactory variations of outlet 
temperature with load, consequently, the optimum location would 
be at some point between this center of gravity and the outlet. 
This leads to the conclusion that, where multiple exchangers 
or multipass units are used, improved control might be obtainet 
by locating a couple between the third and fourth shell, or in- 
serted in the channel ahead of the last pass of tubes. 


C. Concordia‘ and F. P. deMello® 


Correct simulation of transient flow and heat-transfer phe- 
nomena is becoming increasingly important as automatic control 
is applied to processes in which these phenomena play a dominant 
role. The author’s treatment of the subject is an important con- 
tribution in this field. 

We have noted in particular the discussion of Model No. 4 
and the author’s suggestion for use of a better or more logical 
model. We cannot entirely agree with his treatment of this 
case for the following reasons: 

Mathematical relations describing the performance of Model 
4 are partial differential equations in space and time. A feasible 
solution of such partial differential equations can be obtained by 
eliminating the space variable through lumped-constant repre- 
sentation whereby the space is broken up into finite nodes. A 
true representation valid for all frequencies would require an 
infinite number of nodes. However, in most actual problems the 
frequency band of interest is limite 1, and a finite-number-of- 
node representation is adequate. 

The partial differential equation describing the system of 
Model 4 can be written as 


oT oT 
Mc — + cW — hA(T, T) =0 (65) 
ot ox 
where 
M = mass per unit length 
ce = specific heat of fluid 
W = mass flow per unit of time 
T = temperature of fluid (function of x and ¢) 
T,, = temperature of metal (function of z and ¢) 


m 
hA = heat-transfer coefficient per unit of length 


Equation (65) can be approximated by an ordinary differential 
equation relating variables defined at discrete points, xz; and 22, 
with corresponding temperatures of 7; and 7T:. When these 
points are close enough together the differential equation applies 
to an elemental volume for which the average temperature can 
T, + T: 


be approximated as This procedure results in equation 


(40) of the paper, which is commonly used. 
The fallacy is to use only one node to simulate the heat ex- 
changer. Actually with 3 nodes the transfer function would be 


TAP) 
T\(p) 


( Me/3 y 
Ge oc — 2Wee — hA/3 


+ + 


+ hA/3 
* Electric Utility Engineering Section, General Electric Company, 
Schenectady, N. Y. Fellow ASME. 
5 General Electric Company, Schenectady, N. Y. 
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and with n nodes the transfer function would be: 


ow hA (: Me/n y 
TAp) n 2Woe —hAjn”) 


ome n 2Wic + hA/n? 


The form of equation (67) shows that a pseudo transport delay 
is involved since the approximation to a transport lag is 


i-p-— 
n 
n 
When hA/n = 0, i.e., when there is no heat transfer from metal 


to fluid, expression (67) is exactly of the form of expression (68), 
moreover this is the actual case since, with no heat transfer a 


T\(p) 


change in 7’, propagates to T, as a pure transport lag. 

Typical of the finite-number-of-node approximation of a partial 
differential equation in space and time is the so-called ringing 
effect, where the output variable oscillates at a frequency pro- 
portional to the number of nodes. Where this frequency is 
high compared to the frequency of other phenomena of interest 
it does not introduce objectionable errors since it can be recog- 
nized for what it is. 

We were also interested to note that the author dismisses the 
analysis of the steady state as relatively simple. Unfortunately 
this is not always the case. Indeed in many cases an adequate 
analysis of the steady-state performance may require much more 
effort than that required for adding the dynamic features. This 
is because in the design of control systems the required accuracy 
of representation of steady-state phenomena may be considerably 
greater than that of dynamic effects, and it is often necessary 
to consider nonlinear effects. Once the actual modes of operation 
are properly understood, the dynamic effects can sometimes be 
approximated. Even when improvements in performance during 
transients is the primary object of a study, the steady-state 
representation may still have major importance. An example 
occurs in the design of so-called noninteracting controls. 


Arvid Hempel® 


The author’s idea of trying to characterize the dynamic be- 
havior of heat exchangers by a single parameter is a very at- 
tractive one and is well worth investigating. In my opinion, 
however, it seems strange that a liquid-liquid heat exchanger can 
be described by the same transfer function as a steam-liquid heat 
exchanger. In this paper only experimental results from one 
liquid-liquid heat exchanger are presented, and these are those 
which do not fit in very well with the others. It is not clearly 
stated on which parameter these data are normalized; whether 
it is half the dwelltime of the shell flow or the tube flow. Ac- 
cording to the treatment of model 3, one should expect that the 
time constant of the tube fluid would be used. 

I would suggest that the two systems mentioned above should 
be treated separately. 

Concentrating on the steam-liquid system: From a theoretical 
point of view nothing can be said about any time constants in the 
system from knowledge of the dwelltime of the liquid. For 
instance, doubling of the length of the heat exchanger does not 
affect any time constants. However, sound static design of heat 
exchangers requires that the ratio of the dwell time 7, and the 
time constant of the liquid 7, be fairly constant for all heat ex- 


*Chr. Michelsens Institute, Dept. of Applied Physics, Bergen, 
Norway. 
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changers of this type. Within limits it might be possible that 
the time constant associated with the tubes, is related to the 
liquid time constant and therefore also to the dwell time. This 
may be the reason that a treatment based on the dwell time 
perhaps will be adequate in determining the dynamic behavior 
of industrial heat exchangers. According to the foregoing reason- 
ing, one should be careful in selecting experimental results for 
further checking of this theory and not use data from heat ex- 
changers of abnormal static design. 

To illustrate this reservation, I have checked the author’s 
result against frequency response data from an industrial tube 
and shell single pass heat exchanger, with eighteen Cr-Ni tubes 
in the shell. 

Condensing steam was used to heat and boil a solution flowing 
at a rate of 40 dm'/min. The length of the heat exchanger was 
7900 mm, and the dwell time 1.2 min. The result is shown to- 
gether with the author’s Fig. 15. 
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It is seen that the agreement is very poor. This may be ex- 
plained by the fact that, because boiling occurs in the heat ex- 
changer, the ratio r,/7, has not the normal value for the heat 
exchanger. 

The first part of the paper represents a new approach of de- 
riving the transfer functions for heat exchangers. I cannot, 
however, clearly see the relation between this and the results 
obtained in the last part. None of the derived transfer functions 
are of the form found empirically to describe the process. 

Obviously, the important task of this approach is to derive the 
impulse response of the systems. Even if only an approxi- 
mate impulse response of the given model is wanted, it seems to 
me that the author has gone too far with the simplifications in 
deriving the impulse responses for some of his models. This is 
pointed out in the following remarks. 


Model 2 

First the author here has made an unnecessary assumption 
that the temperature distribution of the water is linear. As 
the system is linear, the impulse in the steam temperature may 
be considered to occur when the temperature is uniform through- 
out the system, thus the temperature distribution will not affect 
the result at all. 

The response given in Fig. 4 cannot be correct because, when the 
impulse has disappeared, the water will lose heat at a rate 
governed by the time constant 7,, of the water, until 4 = 7,, then 
it drops to zero. Analytically, this may be expressed 


Tt) = Ulta — 
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where U(r, — t) is the unit step function. 
Thus 


( for 
t) = 
0 for 


And by Laplace transforming this, the transfer function 
1 
—{(s+—)r 
f —e ( oe ‘| is obtained. 


The considerations made concerning the response to the velocity, 
however, seem to be correct. 


$1 


Model 3 


This model is not easy to analyze, and I cannot see that the 
author’s result represents the response of the system with an 
acceptable degree of accuracy. 

For his equation (24) to be correct, the system initially must 
be of uniform temperature. This may well be assumed but it is 
not clearly stated. What is overlooked by the author is that 
the shell fluid, after passing the impulse in the tube, will transfer 
heat back to the water in the tube at a rate governed by the 
shell flow time constant and the temperature of the shell fluid. 
This means that the first part of the response curve is described 
by a function depending on both 7, and 7,. This will affect the 
second part of the response curve as well, because the heat thus 
gained by the tube fluid in the counterflow part is transferred 
back to the shell fluid in the parallel flow part. This will give a 
gradual decrease of the shell fluid outlet temperature after the 
pulselike response given by the author. 

This is the main objection to the response given by the author. 
In addition the author has used a pulse instead of an impulse. 
This is perhaps of minor importance but the effect on the trans- 
fer function could have been mentioned. I also find it strange 
that the author calculates the pulse width from the value of Co, 
which then has to be estimated. It would be more natural to 
calculate Cy from a given pulse width. 


Model 5 


The impulse response of this model, given in Fig. 11, is also a 
simplified one, because actually the heat gained by the fluid 
will depend on the temperature of the fluid as well as the time 
constant 7,,, and the fluid temperature will vary and make the 
calculation complicated. 

However, accepting the exponentially decaying response given, 
it is not necessary to state the duration of the response. In 
fact, if the response is written 


T) = U(t — 


where 
U(t — 7,) is the unit step function 


then Fourier transformation at once gives the equation (55) of 
the author. 


A. L. London’ 


This discusser agrees with the author’s view that a normalized 
time makes for considerable generalization in describing the 
response of heat exchangers. In another paper® this problem is 


7 Professor of Mechanical Engineering, Stanford University, Stan- 
ford, Calif. Mem. ASME 

8A. L. London, F. R. Biancardi, and J. W. Mitchell, ‘‘The Tran- 
sient Response of Gas-Turbine-Plant Heat Exchangers—Regenera- 
tors, Intercoolers, Precoolers, and Ducting.’”’ Trans. ASME, series 
A, Journal of Engineering for Power, vol. 81, 1959, pp. 433-448. 
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considered from the dimensional analysis point of view. These 
results combined with some analog solutions plus some very 
special case analytical solutions lead to the conclusion that 
normalized time @* and normalized wall capacitance C,* in 
combination provide even greater generality. This was not evi- 
dent in the author’s treatment because he deals with a class of 
exchangers (thin metal wall) where the wall capacitance pa- 
rameter C,* approaches zero. Although results are reported® in 
the form of transient-response curves, these parameters, as well 
as the others considered,* will apply also to the frequency response 
as noted by the author in reference [17]. 


Table 2° should prove to be useful in the classification of heat 
exchangers. 


Y. Takahashi® 


The author is to be congratulated for presenting a new, simple 
approach in heat exchanger dynamics fitted for control engineers’ 
use. 

There are many standpoints in dealing with the analysis and 
design of heat exchangers. Also there are so many varieties in 
types, size, constructional details, and operating modes. 

However, the information required by control engineers on 
heat exchangers is rather simple. The transfer function, es- 
pecially the dynamic part of the transfer function, is the in- 
formation they need. The static part of the transfer function, 
the d-c gain, is important but may be found elsewhere, from 
steady-state design calculation, for example. Even when the 
system is nonlinear, which is almost always the case if one 
closely investigates a system, the approximate linear transfer 
function will be the information to be used, although cautiously, 
in dynamic system design. 

As is pointed out by the author, one of the difficulties in deriving 
the transfer function or frequency response is in the complexity of 
the system. Quite often it is a distributed parameter system re- 
quiring partial differential equations to describe the system. 

Here is the beauty of the author’s simple, unified approach. 
It is a thinking experiment, not an actual test. Through the 
thinking experiment one can visualize an impulse response, as 
described in the paper for several models, which in turn may be 
expressed in a mathematical transfer function form. One must 
have intuition and experience in order to eliminate factors of 
secondary importance, leaving only major factors. A factor 

such as heat storage at one place may be neglected in one case, 
but not so in other cases. 


Thus the accuracy of the result may be influenced by the as- 
sumptions one makes in his derivation process. Also, the re- 
quired accuracy, which is dependent upon the problem at hand, 
must be considered. Thus the thinking experiment process itself 
may be regarded as a feedback process including the control 
engineer in the closed loop. Through that feedback process, 
use of the impulse-response principle may yield the frequency 
response with satisfactory accuracy on a band-width of impor- 
tance for system design. 


Author's Closure 


The author thanks the discussers for their interesting and 
valuable contributions further elucidating the problem of heat- 
exchanger dynamics. 

Dr. Bergman’s pertinent observations show that for many 
years dwell time has been recognized as a governing parameter 
in the control of heat exchangers. It is surprising, therefore, 
that normalized time and normalized frequency, both based upon 
dwell time, were not used earlier to unify apparently diverse re- 
searches on the dynamics of heat exchangers. 


* Professor of Mechanical Engineering, University of California, 
Berkeley, Calif. 
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Messrs. Concordia and deMello have shown a line of reasoning 
for arriving at the commonly used Equation (40) of the paper 
which starts out rather logically with a partial differential equa- 
tion. In their development they at least point out some of the 
unsatisfactory features, such as the fallacy of using only a single 
node, described by Equation (40), and the occurrence of the so- 
called ringing effect when a finite number of nodes is used. While 
such an approach is preferable to a derivation of Equation [40) 
based upon the average temperature, the danger remains that 
Equation (40) will be used uncritically. 

Let us remember that Model 4 is supposed to describe a 
nuclear-reactor fuel element. We wish to relate the inlet and out- 
let fluid temperatures. Note that with constant heat generation 
in the fuel element a small increase in inlet temperature will 
eventually be reflected in an equivalent rise of the outlet tem- 
perature, at least to a first approximation. This would not be the 
case in Model 2 with constant steam temperature. For an ade- 
quate description of Model 4 we must therefore have unity ampli- 
tude ratio at zero frequency for the function relating inlet and 
outlet temperatures. The function must also contain a dead time 
or dwell time, and a time constant. The step-transient response 
will then be more realistic, without the misleading “‘reverse-reac- 
tion effect’ associated with Equation (40). A model based 
upon Model 1 plus dead time should therefore provide a more 
satisfactory model for relating fluid temperatures, in better agree- 
ment with observed results, free from such effects as the reverse 
reaction, ringing, and steady-state gain less than unity. 

Messrs. Concordia and de Mello have misunderstood the author 
when they remark: “We have noted . . . the author’s suggestion 
for use of a better or more logical model.” 
but better, definitely not! The author hopes that a careful re- 
reading of his remarks in connection with the “new, but more 
logical analysis’’ will convince them that he has been careful not 
to pass judgment on the merits of the “‘more logical’’ model. 

It is the author’s impression that steady-state analysis is gen- 
erally regarded as being much simpler than transient analysis. 
The greater part of engineering education, for example, is still 
devoted to steady-state analyses, largely because we know so little 
about the transient state. 


More logical, yes; 


In this connection there are those who 
maintain that thermodynamics as now taught is misnamed, that 
it should be called thermostatics. It is not until students have 
acquired maturity and mathematical skill that they are able to 
cope with some aspects of the transient behavior of systems. 
The author is convinced that Messrs. Concordia and deMello 
know this, but that they want to warn the unwary reader that 
even the steady-state analysis can have its pitfalls. And such a 
pitfall is the steady-state gain less than unity for Equation (40) 
relating 7 and 7. 

Mr. Hempel discusses Models 2,3, and 5. Starting with Model 
2, he remarks that it is not clearly stated on which parameter the 
data for the liquid-liquid heat exchanger in Fig. 5 are normalized. 
The author apologizes for this oversight. The reported dwell 
time of 30 seconds refers to the liquid inside the tubes. Boiling of 
the shell-side liquid apparently produced a sufficiently uniform 
shell-side temperature for Model 2 assumptions to be applicable. 
The author’s comments under “Additional Observations” explain 
why the data for the liquid-liquid heat exchanger deviate from the 
other data in Fig. 5 and give the reason for nevertheless including 
them on Fig. 5. 

10 Of interest. is a comment concerning initial negative values made 
by Professor H. M. Paynter on page 242 of “A Palimpsest on the Elec- 
tronic Analog Art,’’ edited by him and printed by George A. Philbrick 
Researches, Inc., Boston, Mass., 1955: “. . . the initial negative 
values . . . [are] characteristic of the method and not of the natural 
stream.’ Fig. 15(d) on page 241 of that publication shows the re- 
verse reaction effect. And we note in passing that the reverse reac- 
tion actually does occur in boiler-level control, and also in airplane 
control, for example. 
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Mr. Hempel warns that one should “. . . not use data from heat 
exchangers of abnormal static design.’”’ If Mr. Hempel means 
that the heat-exchangers reported on under Model 2 are of “ab- 
normal static design,” then a very large number, perhaps the 
majority of heat exchangers, are indeed operated under “abnormal 
static” conditions. By not using data from such heat exchangers 
one would effectively ignore a large class of industrial heat 
exchangers. 

The fact that Mr. Hempel’s heat exchanger with shell-side 
steam boiling the liquid inside the tubes does not fit Model 2 data 
is not surprising because the assumptions of Model 2 do not apply. 
Nevertheless, it is interesting to note that dividing the normalized 
frequency of the boiling heat exchanger by about 5 will shift its 
amplitude-ratio curve into a beautiful fit with the amplitude- 
ratio curve from Fig. 15. And the discrepancy in phase shift 
at the high-frequency tip of the phase curve is then about 30 de- 
grees, a small error in comparison to the basic similarity between 
the two sets of curves. A dwell time about '/; the reported value 
of 1.2 minutes can account for such a shift. At this point, in the 
absence of sufficient information, the author is tempted to specu- 
late as follows: Can the frequency shift be accounted for by an 
effective liquid length within the tubes equal to '/, the tube length, 
with the remainder of the space occupied by vapor? 

The author did not make the assumption that the temperature 
distribution of the water is linear as indeed such an assumption is 
unnecessary. His statement “. . 
gradient. . .”” does not imply that. 

Mr. Hempel states that “The response given in Fig. 4 cannot be 
correct. ..”’. Perhaps Fig. 4 will be more acceptable in connection 
with the author’s sentence, ‘‘We assume that the thermal time 
constant associated with each slug of water is much longer than 
the dwell time. ..”. If, further, it is remembered that only rela- 
tive or normalized magnitudes are wanted, then it will be easy 
to see that Mr. Hempel’s final equation under Model 2 reduces to 
the author’s Equation (11). 


. a steady average temperature 


Mr. Hempel’s equation incorporates 
second-order effects which the author purposely omitted in 
writing his paper. 

Mr. Hempel does not see any acceptable degree of accuracy in 
the analysis of Model 3 which, he admits, is not easy to analyze. 
He must have forgotten the author’s quotation from “How to 
Hunt a Submarine.” The author found gratification in the fact 
that his simple analysis produced insight into the cause of some 
of the peculiarities associated with the dynamic behavior of this 
type of heat exchanger as reported by Masubuchi [8]. For an 
acceptable degree of accuracy the author refers Mr. Hempel to the 
painstaking work done by Masubuchi [8] 

The second sentence after Equation (27) and the sentence after 
Equation (29) imply that Model 3 was initially at uniform tem- 
perature. Mr. Hempel is right; this was not clearly stated. 

Mr. Hempel has overlooked the fact that in a given length of 
time more heat will transfer from the ‘“‘very hot’’ tube slug to the 
shell-side fluid than can be transferred from the not so very hot 
shell-side liquid back to the liquid in the tube behind the hot 
slug. Thus, as a first approximation, to simplify the analysis, 
the secondary heat transfer has been neglected. 

The author used an imaginary impulse, physically not realiza- 
ble, to inject a great deal of energy into a finite slug of water. 
The finite but unknown width of the slug of water transformed the 
initial impulse into a pulse of finite width for the purpose of this 
problem. Finally, the pulse width was related to Cp. Mr. 
Hempel recognizes that the point is of minor importance. 

The author is glad to agree with Mr. Hempel that Equation (55) 
for Model 5 can be obtained simply by Fourier-transforming the 
exponential response and using as the upper limit on the integral 
infinite time in lieu of time equal to rt, + 4r,,. 

The author is in complete agreement with Professor London’s 
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comments and he strongly recommends a careful study of the 
paper referred to in footnote (8). That very useful paper was 
brought to his attention shortly after this paper had been sub- 
mitted for publication. 

Professor Takahashi has ably summarized the fundamental 
concepts upon which the paper is based. The author is grateful 
to Professor Takahashi for restating the objectives of this paper, 
and for his kind remarks, 

Here the author takes the opportunity to thank Mr. W. I. 
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Caldwell for pointing out, in a private communication dated 
December 8, 1959, that the dashed phase-angle curve in Fig. 5, 
marked —Q, should exhibit discontinuous jumps, from 180 to 360 
degrees, again from 540 to 720 deg, and so on. Beyond 180 deg 
the curve marked —{ is therefore not correct in Fig. 5. And 
Equations (14) and (17) apply only to angles between zero and 4 
radians, 27 and 37 radians, 47 and 57 radians, and so on. As 
a result the similarity to the Cohen and Johnson theoretical curve 
is even more striking. 
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